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Recombinant immunoglobulin preparations, methods for their preparation. DNA 
recombinant host cells therefor. 


sequences, expression vectors and 


0*5 Recombinant DNA techniques are used to produce both 
immunoglobulins which are analogous to those normally 
found in vertebrate systems and to take advantage of these 
gene modification techniques to construct chimeric or other 
modified forms. 
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RECOMBINANT IMMUNOGLOBULIN PREPARATIONS, METHODS 
FOR THEIR PREPARATION, DNA SEQUENCES, EXPRESSION 
VECTORS AND RECOMBINANT HOST CELLS THEREFOR 


Background of the Invention 

This invention relates to the field of immunoglobulin production 
'arid to modification of naturally occuring immunoglobulin amino acid 
sequences. Specifically, the invention relates to using recombinant 
techniques to produce both immunoglobulins which are analogous to 
those normally found in vertebrate systems and to take advantage of 
these gene modification techniques to-construct chimeric or other 
modified forms. 

A. Immunoglobulins and Antibodies 

Antibodies are specific immunoglobulin polypeptides produced by 
the vertebrate immune system in response to challenge by foreign 
proteins, glycoproteins, cells, or other antigenic foreign 
substances. The sequence of events which permits the organism to 
overcome invasion by foreign cells or to rid the system of foreign 
substances is at least partially understood. An important part of 
this process is the manufacture of antibodies which bind 
specifically to a particular foreign substance. The binding 
specificity of such polypeptides to a particular antigen is highly 
refined, and the multitude of specificities capable of being 
generated by the individual vertebrate is remarkable in its 
complexity and variability. Thousands of antigens are capable of 
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eliciting responses, each almost exclusively directed to the 
particular antigen which elicited it. 

Immunoglobulins include both antibodies, as above described, and 
analogous protein substances which lack antigen specificity. The 
latter are produced at low levels by the lynph system and in 
increased levels by myelomas. 

A.l Source and Utility 

Two major souces of vertebrate antibodies are presently 
utilized—generation in situ by the mammalian B lymphocytes and in 
cell culture by B-cell hybrids. Antibodies are made in situ as a 
result of the differentiation of immature B lymphocytes into plasma 
cells, which occurs in response to stimulation by specific 
antigens. In the undifferentiated B cell, the portions of ON A 
coding for the various regions on the immunoglobulin chains are 
separated in the genomic DNA. The sequences are reassembled 
sequentially prior to transcription. A review of this process has 
been given by Gough, Trends in Biochem Sci , 6: 203 (1981). The 
resulting rearranged genome is capable of expression in the mature B 
lymphocyte to produce the desired antibody. Even when only a single 
antigen is introduced into the sphere of the immune system for a 
particular mammal, however, a uniform population of antibodies does 
not result. The in situ immune response to any particular antigen 
is defined by the mosaic of responses to the various determinants 
which are present on the antigen. Each subset of homologous 
antibody is contributed by a single population of B cells—hence in 
situ generation of antibodies is "polyclonal". 

This limited but inherent heterogeneity has been overcome in 
numerous particular cases by use of hybridoma technology to create 
"monoclonal" antibodies (Kohler, et a!., Eur. J. Immunol. , 6: 511 

(1976)). In this process, splenocytes or lymphocytes from a mammal 
which has been injected with antigen are fused with a tumor cell 
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line, thus producing hybrid cells or "hybridomas" which are both 
immortal and capable of producing the genetically coded antibody of 
the B cell. The hybrids thus formed are segregated into single 
genetic strains by selection, dilution, and regrowth, and each 
strain thus represents a single genetic line. They therefore 
produce irmiunoreactive antibodies against a desired antigen which 
are assured to be homogenous, and which antibodies, referencing 
their pure genetic parentage, are called "monoclonal". Hybridoma 
technology has to this time been focused largely on the fusion of 
murine lines, but human-human hybridomas (Olsson, L. et^ al_., Proc. 
Natl. Acad. Sci. (USA) , 77: 5429 (1980)); human-murine hybridomas 

(Schlom, J., et al* ( ibid ) 77: 6841 ( 198 °)) and several other 

xenogenic hybrid combinations have been prepared as well. 
Alternatively, primary, antibody producing, B cells have been 
immortalized in vitro by transformation with viral ONA. 

Polyclonal, or, much more preferably, monoclonal, antibodies 
have a variety of useful properties similar to those of the present 
invention. For .example, they can be used as specific 
immunoprecipitating reagents to detect the presence of the antigen 
which elicited the initial processing of the B cell genome by 
coupling this antigen-antibody reaction with suitable detection 
techniques such as labeling with radioisotopes or with enzymes 
capable of assay (RIA, EMIT, and ELISA). Antibodies are thus the 
foundation of immuno diagnostic tests for many antigenic 
substances. In another important use, antibodies can be directly 
injected into subjects suffering from an attack by a substance or 
organism containing the antigen iri question to combat this attack. 
This process is currently in its experimental stages, but its 
potential is clearly seen. Third, whole body diagnosis and 
treatment is made possible because injected antibodies are directed 
to specific target disease tissues, and thus can be used either to 
determine the presence of the disease by carrying with them a 
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suitable label, or to attack the diseased tissue by carrying a 
suitable drug. 

Monoclonal antibodies produced by hybridomas, while 
> theoretically effective as suggested above and clearly preferable to 
polyclonal antibodies because of their specificity, suffer from 
certain disadvantages. First, they tend to be contaminated with 
other proteins and cellular materials of hybridoma, (and, therefore, 
mammalian) origin. These cells contain additional materials, 

D notably nucleic acid fragments, but protein fragments as well, which 
are capable of enhancing, causing, or mediating carcinogic 
responses. Second, hybridoma lines producing monoclonal antibodies 
tend to be unstable 1 and may alter the structure of antibody produced 
or stop producing antibody altogether (Kohler, G., et al^, Proc. 

5 Natl. Acad. Sci (USA) 77: 2197 (1980); Morrison, S.L., J. Immunol. 

123: 793 (1979)). The cell line genome appears to alter itself in 

response to stimuli whose nature is not currently known, and this 
alteration may result in production of incorrect sequences. Third, 
both hybridoma and B cells inevitably produce certain antibodies in 
>0 glycosylated form (Melchers, F., Biochemistry , 10: 653 (1971)) 

which, under some circumstances, may be undesirable. Fourth, 
production of both monoclonal and polyclonal antibodies is 
relatively expensive. Fifth, and perhaps most important, production 
by current techniques (either by hybridoma or by B cell response) 

25 does not permit manipulation of the genome so as to produce 

antibodies with more effective design components than those normally 
elicited in response to antigens from the mature B cell in situ. 

The antibodies of the present invention do not suffer from the 
foregoing drawbacks, and, furthermore, offer the opportunity to 
30 provide molecules of superior design. 

Even those immunoglobulins which lack the specificity of 
antibodies are useful, although over a smaller spectrum of potential 
uses than the antibodies themselves. In presently understood 
35 
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applications, such immunoglobulins are helpful in protein 
replacement therapy for globulin related anemia. In this context, 
an inability to bind to antigen is in fact helpful, as the 
therapeutic Value of these proteins would be impaired by such 
functionality. At present, such non-specific antibodies are 
derivable in quantity only from myeloma cell cultures suitably 
induced. The present invention offers an alternative, more 
economical source. It also offers the opportunity of cancelling out ' 
specificity by manipulating the four chains of the tetramer 

separately. 

A.2 General Structure Characteristics 

The basic immunoglobin structural unit in vertebrate systems is 
now well understood (Edelman, G.K., Ann. N.Y. Acad. Sc jk_, 190: 5 
(1971)). The units are composed of two identical light polypeptide 
chains of molecular weight approximately 23,000 daltons, and two 
identical heavy chains of molecular weight 53,000 - 70,000. The 
four chains are joined by disulfide bonds in a "Y" configuration 
wherein the light chains bracket the heavy chains starting at the 
mouth of the Y and continuing through, the divergent region as shown 
in figure 1. The "branch" portion, as there indicated, is 
designated the Fab region. Heavy chains are classified as gamma, 
mu, alpha, delta, or epsilon, with some subclasses among them, and 
the nature of this chain, as it has a long constant region, 
determines the “class" of the antibody as IgG, IgM, IgA, IgD, or 
IgE. Light chains are classified as either kappa or lambda. Each 
heavy chain class can be prepared with either kappa or lambda light 
chain. The light and heavy chains are covalently bonded to each 
other, and the "tail" portions of the two heavy chains are bonded to 
each other by covalent disulfide linkages when the immunoglobulins 
are generated either by hybridomas or by B cells. However, if 
non-covalent association of the chains can be effected in the 
correct geometry, the aggregate will still be capable of reaction 
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with antigen, or of utility as a protein supplement as a 
non-specific immunoglobulin. 

The amino acid sequence runs from the N-terminal end at the top 
of the Y to the C-terminal end at the bottom of each chain. At the 
N-terminal end is a variable region which is specific for the 
antigen which elicited it, and is approximately 100 amino acids in 
length, there being slight variations between light and heavy chain 
and from antibody to antibody. The variable region is linked in 
'each chain to a constant region which extends the remaining length 
of the chain. Linkage is seen, at the genomic level, as occuring 
through a linking sequence known currently as the "J" region in the 
light chain gene, which encodes about 12 amino acids, and as a 
combination of “D" region and “J" region in the heavy chain gene, 
which together encode approximately 25 amino acids. 

The remaining portions of the chain are referred to as constant 
regions and within a particular class do not to vary with the 
specificity of the antibody (i.e., the antigen eliciting it). 

As stated above, there are five known major classes of constant 
regions which determine the class of the immunoglobulin molecule 
(IgG, IgM, IgA, IgD, and IgE corresponding to y, u, a, «, and e 
heavy chain constant regions). The constant region or class 
determines subsequent effector function of the antibody, including 
activation of complement (Kabat, E.A., Structural Concepts in 
Immunology and Immunochemistry , 2nd Ed., p. 413-436, Holt, Rinehart, 
Winston (1976)), and other cellular responses (Andrews, D.W., 
et al., Clinical Immunobiology pp‘1-18, W.B. Sanders (1980); Kohl, 
S., et , Immunoloqy , 48: 187 (1983)); while the variable region 

determines the antigen with which it will react. 

B. Recombinant DNA Technology 

Recombinant DNA technology has reached sufficient sophistication 
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tiiat it includes a repertoire of techniques for cloning and 
expression of gene sequences. Various DNA sequences can be 
recombined with some facility, creating new DNA entities capable of 
producing heterologous protein product in transformed microbes and 
cell cultures. The general means and methods for the in vitro 
ligation of various blunt ended or "sticky" ended fragments of DNA, 

for producing expression vectors, and for transforming organisms are 
now in hand. 

DNA recombination of the essential elements (i.e., an origin of 
replication, one or more phenotypic selection characteristics, 
expression control sequence, heterologous gene insert and remainder 
vector) generally is performed outside the host cell. The resulting 
recombinant replicable expression vector, or plasmid, is introduced 
into cells by transformation and large quantities of the recombinant 
vehicle is obtained by growing the transformant. Where the gene is 
properly inserted with reference to portions which govern the 
transcription and translation of the encoded DNA message, the 
resulting expression vector is useful to produce the polypeptide 
sequence for which the inserted gene codes, a process referred to as 
expression." The resulting product may be obtained by lysis, if 
necessary, of the host cell and recovery of the product by 
appropriate purifications from other proteins. 

In practice, the use of recombinant DNA technology can express 
entirely heterologous polypeptides—so-called direct expression—or 
alternatively may express a heterologous polypeptide fused to a 
portion of the amino acid sequence of a homologous polypeptide. In 
the latter cases, the intended bioactive product is sometimes 
rendered bioinactive within the fused, homologous/heterologous 
polypeptide until it is cleaved in an extracellular environment. 

The art of maintaining cell or tissue cultures as well as 
microbial systems for studying genetics and cell physiology is well 
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established. Means and methods are available for maintaining 
permanent cell lines, prepared by successive serial transfers from 
isolated cells. For use in research, such cell, lines are maintained 
on a solid support in liquid medium, or by growth in suspension 
containing support nutriments. Scale-up for large preparations 
seems to pose only mechanical problems. 

Summary of the Invention 

The invention relates to antibodies and to non-specific 
immunoglobulins (NSIs) formed by recombinant techniques using 
suitable host cell cultures. These antibodies and NSIs can be 
readily prepared in pure "monoclonal" form. They can be manipulated 
at the genomic level to produce chimeras of variants which draw 
their homology from species which differ from each other. They can 
also be manipulated at the protein level, since all four chains do 
not need to be produced by the same cell. Thus, there are a number 
of "types" of immunoglobulins encompassed by the invention. 

First, immunoglobulins, particularly antibodies, are produced 
using recombinant techniques which mimic the amino acid sequence of 
naturally occuring antibodies produced by either mammalian B cells 
in situ, or by B cells fused with suitable inmortalizing tumor 
lines, i.e., hybridomas. Second, the methods of this invention 
produce, and the invention is directed to, immunoglobulins which 
comprise polypeptides not hitherto found associated with each other 
in nature. Such reassembly is particularly useful in producing 
"hybrid" antibodies capable of binding more than one antigen; and in 
producing "composite" immunoglobuins wherein heavy and light chains 
of different origins essentially damp out specificity. Third, by 
genetic manipulation, "chimeric" antibodies can be formed wherein, 
for example, the variable regions correspond to the amino acid 
sequence from one mammalian model system, whereas the constant 
region mimics the amino acid sequence of another. Again, the 
derivation of these two mimicked sequences may be from different 
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species. Fourth, also by genetic manipulation, "altered" antibodies 
with improved specificity and other characteristics can be formed. 


5 


10 


Two other types of immunoglobulin-like moieties may be 
produced: "univalent" antibodies, which are useful as homing 

carriers to target tissues, and "Fab proteins" which include only 
the "Fab" region of an immunoglobulin molecule i.e, the branches of 
the "Y". These univalent antibodies and Fab fragments may also be 
"mammalian" i.e., mimic mammalian amino acid sequences; novel 
assemblies of mammalian chains, or chimeric, where for example, the 
constant and variable sequence patterns may be of different origin. 
Finally, either the light chain or heavy chain alone, or portions 
thereof, produced by recombinant techniques are included in the 
invention and may be mammalian or chimeric. 


15 . In other aspects, the invention is directed to DNA which encodes 

the aforementioned NSIs, antibodies, and portions thereof, as well 
as expression vectors or plasmids capable of effecting the 
production of such immunoglobulins in suitable host cells. It 
includes the host cells and cell cultures which result from 
transformation with these vectors. Finally, the invention is 
directed to methods of producing these NSIs and antibodies, and the 
DNA sequences, plasmids, and transformed cells intermediate to them. 

Bri ef Description of the Drawings 
25 — 

Figure 1 is a representation of the general structure of 

immunoglobulins. 

3Q Figure 2 shows the detailed sequence of the cDNA insert of pK17G4 
which encodes kappa anti CEA chain. 

Figure 3 shows the coding sequence of the fragment shown in Figure 
2, along with the corresponding amino acid sequence. 
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Figure 4 shows the combined detailed sequence of the cDNA inserts of 
Pr298 and pyll which encode gamma anti CEA chain. 

Figure 5 shows the corresponding amino acid sequence encoded by the 
5 fragment in Figure 4. 

Figures 6 and 7 outline the construction of expression vectors for 
kappa and gamma anti-CEA chains respectively. 

10 Figures 8A, 88, and 8C show the results of sizing gels run on 

extracts of E. coli expressing the genes for gamma chain, kappa 
chain, and both kappa and gamma chains respectively. 

Figure 9 shows the results of western blots of extracts of cells 
•J 5 transformed as those in Figures 8. 

Figure 10 shows a standard curve for ELISA assay of anti CEA 
activity. 

2 Q Figures 11 and 12 show the construction of a plasmid for expression 
of the gene encoding a chimeric heavy chain. 

Figure 13 shows the construction of a plasmid for expression of the 
gene encoding the Fab region of heavy chain. 

25 

Detailed Description 
A. Definitions 

As used herein, "antibodies" refers to tstramers or aggregates 
3 Q thereof which have specific immunoreactive activity,' comprising 

light and heavy chains usually aggregated in the "Y" configuration 
of Figure 1, with or without covalent linkage between them; 
"immunoglobulins" refers to such assemblies whether or not specific 
immunoreactive activity is a property. "Non-specific 

35 
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immunoglobul in" ("NSI") means those immunoglobulins which do not 
possess specificity—i.e., those which are not antibodies. 


"Mammalian antibodies" refers to antibodies wherein the amino 
5 acid sequences of the chains are homologous with those sequences 
found in antibodies produced by mammalian systems, either in situ 
or in hybridomas. These antibodies mimic antibodies which are 
otherwise capable of being generated, although in impure form, in 
these traditional systems. 


10 


15 


20 


"Hybrid antibodies" refers to antibodies wherein chains are 
separately homologous with referenced mammalian antibody chains and 
represent'novel assemolies of them, so that two different antigens 
are precipitable by the tetramer. In hybrid antibodies, one pair of 
heavy and light chain is homologous to antibodies raised against one 
antigen, while the other pair of heavy and light chain is homologous 
to those raised against another antigen. This results in the 
property of "divalence" i.e., ability to bind two antigens 
simultaneously. Such hybrids may, of course, also be formed using 
chimeric chains, as set forth below. 


"Composite" immunoglobulins means those wherein the heavy and 
light chains mimic those of different species origins or 
specificities, and the resultant is thus likely to be a non-specific 
25 immunoglobulin (NSI), i.e.—lacking in antibody character. 


30 


"Chimeric antibodies" refers to those antibodies wherein one 
portion of each of the amino acid sequences of heavy and light 
chains is homologous to corresponding sequences in antibodies 
derived from a particular species or belonging to a particular 
class, while the remaining segment of the chains is homologous to 
corresponding sequences in another. Typically, in these chimeric 
antibodies, the variable region of both light and heavy chains 
mimics _the variable regions of antibodies derived from one species 
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of mammals, while the constant portions are homologous to the 
sequences in antibodies derived from another. One clear advantage 
to such chimeric forms is that, for example, the variable regions 
can conveniently be derived from presently known sources using 
readily available hybridomas or B cells from non human host 
organisms in combination with constant regions derived from, for 
example, human cell preparations. While the variable region has the 
advantage of ease of preparation, and the specificity is not 
affected by its source, the constant region being human, is less 
likely to elicit an immune response from a human subject when the 
antibodies are injected than would the constant region from a 
non-human source. \ 


However, the definition is not limited to this particular 
example. It includes any antibody in which either or both of the 
heavy or light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different sources, whether 
these sources be differing classes, differing antigen responses, or 
differing species of origin and whether or not the fusion point is 
at the variable/constant boundary. Thus, it is possible to produce 
antibodies in which neither the constant nor the variable region 
mimic known antibody sequences. It then becomes possible, for 
example, to construct antibodies whose variable region has a higher 
specific affinity for a particular antigen, or whose constant region 
can elicit enhanced complement fixation or to make other 
improvements in properties possessed by a particular constant region. 


"A1 tered'antibodies" means antibodies wherein the amino acid 
sequence has been varied from that of a mammalian or other 
2 q vertebrate antibody. Because of the relevance of recombinant DNA 
techniques to this invention, one need not be confined to the 
sequences of amino acids found in natural antibodies; antibodies can 
be redesigned to obtain desired characteristics. The possible 
variations are many and range from the changing of just one or a few 

35 


0312L 














0125023 


-13- 


amino acids to the complete redesign of, for example, the constant 
region. Changes in the constant region will, in general, be made in 
order to improve the cellular process characteristics, such as 
complement fixation, interaction with membranes, and other effector 
functions. Changes in the variable region will be made in order to 
improve the antigen binding characteristics. The antibody can also 
be engineered so as to aid the specific delivery of a toxic agent 
according to the "magic bullet" concept. Alterations, can be made 
by standard recombinant techniques and also by oligonucleotide- 
directed mutagenesis techniques (Dalbadie-McFarland, et al Proc. 

Natl. Acad. Sci. ( USA ), 79:6409 (1982)). 

•-— “Univalent antibodies" refers to aggregations which comprise a 
heavy chain/light chain dimer bound to the Fc (or stem) region of a 
second heavy chain. S.uch antibodies are specific for antigen, but 
have the additional desirable property of targeting tissues with 
specific antigenic surfaces, without causing its antigenic 
effectiveness to be impaired—i.e., there is no antigenic 
modulation. This phenomenon and the property of univalent 
antibodies in this regard is set forth in Glennie, M.J., et aj_., 
Nature, 295: 712 (1982). Univalent antibodies have heretofore been 

formed by proteolysis. 

"Fab" region refers to those portions of the chains which are 
roughly equivalent, or analogous, to the sequences which comprise 
the Y branch portions of the heavy chain and to the light chain in 
its entirety, 5 and which collectively (in aggregates) have been shown 
to exhibit antibody activity. “Fab protein", which protein is. one 
of the aspects of the invention, includes aggregates of one heavy 
and one light chain (commonly known as Fab‘), as well as tetramers 
which correspond to the two branch segments of the antibody Y, 
(commonly known as Ftab^), whether any of the above are 
covalently or non-covalently aggregated, so long as the aggregation 
is capable of selectively reacting with a particular antigen or 
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antigen family. Fab antibodies have, as have univalent ones, been 
formed heretofore by proteolysis, and share the property of not 
eliciting antigen modulation on target tissues. However, as they 
lack the "effector" Fc portion they cannot effect, for example, 
lysis of the target cell by macrophages. 

"Fab protein" has similar subsets according to the definition of 
the present invention as does the general term "antibodies or 
"immunoglobulins". Thus, "mammalian" Fab protein, "hybrid" Fab 
protein "chimeric" Fab and "altered" Fab protein are defined 
analogously to the corresponding definitions set forth in the 
previous paragraphs for the various types of antibodies. 

Individual heavy or light chains may of course be "mammalian", 
"chimeric" or "altered" in accordance with the above. As will 
become apparent, from the detailed description of the invention, it 
is possible, using the techniques disclosed to prepare other 
combinations of the four-peptide chain aggregates, besides those 
specifically defined, such as hybrid antibodies containing chimeric 
light and mammalian heavy chains, hybrid Fab proteins containing 
chimeric Fab proteins of heavy chains associated with mammalian 
light chains, and so forth. 

"Expression vector" includes vectors which are capable of 
expressing DNA sequences contained therein, i.e., the coding 
sequences are operably linked to other sequences capable of 
effecting their expression. It is implied, although not always 
explicitly stated, that these expression vectors must be replicable 
in the host organisms either as episomes or as an integral part of 
the chromosomal DNA. Clearly a lack .of repl icabil ity .would render 
them effectively inoperable. A useful, but not a necessary, 
element of an effective expression vector is a marker encoding 
sequence — i.e. a sequence encoding a protein which results in a 
phenotypic property (e.g. tetracycline resistance) of the cells 
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containing the protein which permits those cells to be readily 
identified. In sum. "expression vector" is given a functional 
definition, and any DNA sequence which is capable of effecting 
expression of a specified contained DNA code is included in this 
term, as it is applied to the specified sequence. As at Present 
such vectors are frequently in the form of plasmids, thus "plasmid 
and "expression vector" are often used interchangeably. However, 
the invention is intended to include such other forms of expression 
vectors which serve equivalent functions and which may, from time o 
time become known in the art. 

"Recombinant host cells" refers to cells which have been 
transformed with vectors constructed using recombinant DNA 
techniques. As defined herein, the antibody or modification ther 
produced by a recombinant host cell is by virtue of this 
transformation, rather than in such lesser amounts, or more 
commonly, in such less than detectable amounts, as would be produced 

by the untransformed host. 

in descriptions of processes for isolation of antibodies from 
recombinant hosts, the terms "cell" and "cell culture" are used 
interchangeably to denote the source of antibody unless it is 
clearly specified otherwise. In other words, recovery of antibo y 
from the "cells" may mean either from spun down whole cells, or from 
the cell culture containing both the medium and the suspended cel s. 


30 


B. Unfi t on Cultures an d Vectors 

The vectors and methods disclosed herein are suitable for use in 
Post cells over a wide range of prokaryotic and eukaryotic organisms. 

In general, of course, prokaryotes are preferred for cloning of 
ONA sequences in constructing the vectors useful in the invention 
For example. E. con K12 strain 294 (ATCC No. 31446) is Particularly 
useful. Other microbial strains which may be used inc e — 
strains sujh as E. coli B, and E. colj *W6 (ATTC No. 31537). 


35 
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These examples are, of course, intended to be illustrative rather 
than limiting. 

Prokaryotes may also be used for expression. The aforementioned 
5 strains, as well as Ji. coli W3110 (F“, x - , prototrophic, ATTC 
No. 27325), bacilli such as Bacillus subtil us , and other 
enterobacteriaceae such as Sal monel 1 a typhi murium or Serratia 
marcesans, and various Pseudomonas species may be used. 

10 * n 9 eneral, plasmid vectors containing replicon and control 

sequences which are derived from species compatible with the host 
cell are used in connection with these hosts. The vector ordinarily 
carries a-replication site, as well as marking sequences which are 
capable of providing phenotypic selection in transformed cells. For 
15 example, _E. col i is typically transformed using pBR322, a plasmid 
derived from an coli species (Bolivar, et al.. Gene 2: 95 
(1977)). pBR322 contains genes for ampicillin and tetracycline 
resistance and thus provides easy means for identifying transformed 
cells. The pBR322 plasmid, or other microbial plasmid must also 
20 contain, or be modified to contain, promoters which can be used by 
the microbial organism for expression of its own proteins. Those 
promoters most commonly used in reconfoinant ONA construction include 
the 8 -lactamase (penicillinase) and lactose promoter systems (Chang 
et al. Nature , 275: 615 (1978); Itakura, et al. Science . 198: 1056 
25 ( 1977 )J (Goeddel, et al Nature 281: 544 (1979)) and a tryptophan 

(trp) promoter system (Goeddel, et al. Nucleic Acids Res. . 8 : 4057 
(1980); EPO Appl Publ No. 0036776). While these are the most 
commonly used, other microbial promoters have been discovered and 
uti 1 ized, and details concerning £heir nucleotide sequences have 
3Q been published, enabling a skilled worker to ligate them 

functionally with plasmid vectors (Sieb'enl ist, et al. Cell 20 : 269 
(1980)). 

In addition to prokaryates, eukaryotic microbes, such as yeast 
35 cultures may also be used. Saccharomyces cerevisiae. or common 
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baker's yeast is the most commonly used among eukaryotic 
microorganisms, although a number of other strains are commonly 
available. For expression in Saccharomyces , the plasmid YRp7, for 
example, (Stinchcomb, et al, Nature , 282 : 39 (1979); Kingsman et al, 
Gene, 7: 141 (1979); Tschemper, et al. Gene , 10: 157 (1980)) is 
Commonly used. This plasmid already contains the tr£l gene which 
i provides a selection marker for a mutant strain of yeast lacking the 
ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1 
(Jones, Genetics , 85: 12 (1977)). The presence of the tr£l lesion 
as a characteristic of the yeast host cell genome then provides an 
effective environment for detecting transformation by growth in the 

absence of tryptophan. 

Suitable promoting sequences in yeast vectors include the 
promoters for 3 -phosphoglycerate kinase (Hitzeman, et al., J- BioK 
Chem., 255: 2073 (1980)) or other glycolytic enzymes (Hess, et al, 
7~Ad v. Enzyme Reg. , 7: 149 (1968); Holland, et al. Biochemis try, 

17 ; 4900 (1978)), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 

3 -phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase, and glucokinase. In 
constructing suitable expression plasmids, the termination sequences 
associated with these genes are also ligated into the expression 
vector 3' of the sequence desired to be expressed to provide 
polyadenylation of the mRNA and termination. Other promoters, which 
have the additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase’,' degradative enzymes associated 
with nitrogen metabolism, and the aforementioned glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and 
galactose utilization (Holland, ibid.). Any plasmid vector 
containing yeast-compatible promoter, origin of replication and 
termination sequences is suitable. 
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In addition to mi croorganisms, cultures of cells derived from 
multicellular organisms may also be used as hosts. In principle, 
any such cell culture is workable, whether from vertebrate or 
invertebrate culture. However interest has been greatest in 
vertebrate cells, and propogation of vertebrate cells in culture 
(tissue culture) has become a routine procedure in recent years 
( Tissue Culture , Academic Press, Kruse and Patterson, editors 
(1973)). Examples of such useful host cell lines are VERO and HeLa 
cells, Chinese hamster ovary (CHO) cell lines, and W138, BHK, COS-7 
and MDCK cell lines. Expression vectors for such cells ordinarily 
include (if necessary) an origin of replication, a promoter located 
in front of the gene to be expressed, along with any necessary 
ribosome binding sites, RNA splice sites, polyadenylation site, and 
transcriptional terminator sequences. 

For use in mammalian cells, the control functions on the 
expression vectors are often provided by viral material. For 
example, commonly used promoters are derived from polyoma. 

Adenovirus 2, and most frequently Simian Virus 40 (SV40). The early 
and late promoters of SV40 virus are particularly useful because 
both are obtained easily from the virus as a fragment which also 
contains the SV40 viral origin of replication (Fiers, et al. Nature, 
273: 113 (1978)) incorporated herein by reference. Smaller or 
larger SV40 fragments may also be used, provided there is included 
the approximately 250 bp sequence extending from the Hind III site 
toward the Bgl I site located in the viral origin of replication. 
Further, it is also possible, and often desirable, to utilize 
promoter or control sequences normally associated with the desired 
gene sequence, provided such control sequences are compatible with 
the host cell systems. 

An origin of replication may be provided either by construction 
of the vector to include an exogenous origin, such as may be derived 
from SV40 or other viral (e.g. Polyoma, Adeno, VSV, BPV, etc.) 
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source, or may be provided by the host cell chromosomal replication 
mechanism. If the vector is integrated into the host cell 
chromosome, the latter is often sufficient. 

It will be understood that this invention, although described 
herein in terms of a preferred embodiment, should not be construed 
as limited to those host cells, vectors and expression systems 

exempl i f ied. 

C. Methods. Employed 

C.l Transformation: 

If cells without formidable cell wall barriers are used as host 
cells transfection is carried out by the calcium phosphate 
precipitation method as described by Graham and Van der Eb 
Virology, 62: 546 (1978). However, other methods for intro ucing 

cells such as by nuclear injection or by protoplast fusion 

may also be used. 

If prokaryotic cells or cells which contain substantial cell 
w .„ constructions are used, the preferred method of transfection is 
calcium treatment using calcium chloride as described by Cohen, F.N. 
et al Proc. Natl. Acad. Sci . (USA). 69: 2110 (1972). 

r ? Vpctor Construction 

Construction of suitable vectors containing the desired coding 
end control sequences employ standard ligation techniques. Isolate 
plasmids or DNA fragments are cleaved, tailored, and religated i 
the form desired to form the plasmids required. The methods 
employed are not dependent on thrDNA source, or intended host. 

3 Cleavage is perform by treating with restriction enzyme (or 

enyzmes) in suitable buffer. In general, about 1 ug P as"" or 
fragments is used with about 1 unit of enzyme in about 20 ml of 
- buffer solution. (Appropriate buffers and substrate amounts for 
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particular restriction enzymes are specified by the manufacturer.) 
Incubation times of about 1 hour at 37*C are workable. After 
incubations, protein is removed by extraction with phenol and 
chloroform, and the nucleic acid is recovered from the aqueous 
5 fraction by precipitation with ethanol. 

If blunt ends are required, the preparation is treated for 15 
minutes at 15 with 10 units of E. coli DNA Polymerase I (Klenow), 
phenol-chloroform extracted, and ethanol precipitated. 

10 

Size separation of the cleaved fragments is performed using 6 
percent polyacrylamide gel described by Goeddel, D., et al, Nucleic 
Acids Res: , 8: 4057 (1980) incorporated herein by reference. 

15 For ligation, approximately equimolar amounts of the desired 

components, suitably end tailored to provide correct matching are 
treated with about 10 units T4 DNA ligase per 0.5 ug DNA. (When 
cleaved vectors are used as components, it may be useful to prevent 
religation of the cleaved vector by pretreatment with bacterial 

20 alkaline phosphatase.) 

In the examples described below correct ligations for plasmid 
construction are confirmed by transforming E. coli K12 strain 294 
(ATCC 31446) with the ligation mixture. Successful transformants 
were selected by ampicillin or tetracycline resistance depending on 
the mode of plasmid construction. Plasmids from the transformants 
were then prepared, analyzed by restriction and/or sequenced by the 
method of Messing, et al. Nucleic Acids Res. , 9:309 (1981) or by the 
method of Maxam, et al, Methods in Enzymoloqy , 65:499 (1980). 

30 ' , 

D. Outline of Procedures 

0.1 Mammalian Antibodies 

The first type of antibody which forms a part of this invention, 
and is prepared by the methods thereof, is "mammalian antibody"-one 

35 
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wherein the heavy and light chains, mimic the amino acid sequences of 
an antibody otherwise produced by a mature mammalian B lymphocyte 
either in situ or when fused with an immortalized cell as part of a 
hybridoma culture. In outline, these antibodies are produced as 

fol lows: 

i 

Messenger RNA coding for heavy or light chain is isolated from a 
suitable source, either mature B cells or a hybridoma culture, 
employing standard techniques of RNA isolation, and the use of 
oligo-dT cellulose chromatography to segregate the poly-A mRNA.. 

The poly-A mRNA may, further, be fractionated to obtain sequences of 
sufficient size to code for the amino acid sequences in the light or 
heavy chain of the desired antibody as the case may be. 


15 


20 


25 


30 


A cDNA library is then prepared from the mixture of mRNA using a 
suitable primer, preferably a nucleic acid sequence which is 
characteristic of the desired cDNA. Such a primer may be 
hypothesized and synthesized based on the amino acid sequence of the 
antibody if the sequence is known. In the alternative cDNA from 
unfractionated poly—A mRNA from a cell line producing the desired 
antibody or poly-dT may also be used. The resulting cDNA is 
optionally size fractionated on polyacrylamide gel and then extended 
with, for example, dC residues for annealing with pBR322 or other 
suitable cloning vector which has been cleaved by a suitable 
restriction enzyme, such as Pst I, and extended with dG residues. 
Alternative means of forming cloning vectors containing the cDNA 
using other tails and other cloning vector remainder may, of course, 
also be used but the foregoing is a standard and preferable choice. 
A suitable host cell strain, typically E. col i , is transformed with 
the annealed cloning vectors, and the successful transformants 
identified by means of, for example, tetracycline resistance or 
other phenotypic characteristic residing on the cloning vector 
plasmid. 
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Successful transformants are picked and transferred to 
microtiter dishes or other support for further growth and 
preservation. Nitrocellulose filter imprints of these growing 
cultures are then probed with suitable nucleotide sequences 

5 containing bases known to be complementary to desired sequences in 
the cDNA. Several types of probe may be used, preferably synthetic 
single stranded DNA sequences labeled by kinasing with ATP^. The 
cells fixed to the nitrocellulose filter are lysed, the DNA 
denatured, and then fixed before reaction with kinased probe. 

10 Clones which successfully hybridize are detected by contact with a 
photoplate, then plasmids from the growing colonies isolated and 
sequenced by means known in the art to verify that the desired 
portions of the gene are present. 

1 5 The desired gene fragments are excised and tailored to assure 

appropriate reading frame with the control segments when inserted 
into suitable expression vectors. Typically, nucleotides are added 
to the 5' end to include a start signal and a suitably positioned 
restriction endonuclease site. 

20 

The tailored gene sequence is then positioned in a vector which 
contains a promoter in reading frame with the gene and compatible 
with the proposed host cell. A number of plasmids such as those 
described in U.S. Pat. Appln. Ser. Nos. 307473; 291892; and 305657 

25 (EPO Publ. Nos. 0036776; 0048970 and 0051873) have been described 
which already contain the appropriate promoters, control sequences, 
ribosome binding sites, and transcription termination sites, as well 
as convenient'markers. 


5 


10 


15 
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In the present invention, the gene coding for the light chain 
and that coding for the heavy chain are recovered separately by the — 

procedures outlined above. Thus they may be inserted into separate 
expression plasmids, or together in the same plasmid, so long as 
each is under suitable promoter and translation control. ■ 

j 

j 
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The expression vectors constructed above are then used to 
transform suitable cells. The light and heavy chains may be 
transformed into separate cell cultures, either of the same or of 
differing species; separate plasmids for light and heavy chain may 
be used to co-transform a single cell culture, or, finally, a single 
expression plasmid containing both genes and capable of expressing 

the genes for both light and heavy chain may be transformed into a 

\ 

single cell culture. 

Regardless of which of the three foregoing options is chosen, 
the cells are grown under conditions appropriate to the production 
of the desired protein. Such conditions are primarily mandated by 
the type of promoter and control systems used in the expression 
vector, rather than by the nature of the desired protein. The 
protein thus produced is then recovered from the cell culture by 
methods known in the art, but choice of which is necessarily 
dependent on the form in which the protein is expressed. For 
example, it is common for mature heterologous proteins expressed in 
E. coli to be deposited within the cells as insoluble particles 
whidTTequire cell lysis and solubilization in denaturant to permit 
recovery. On the other hand, proteins under proper synthesis 
circumstances, in yeast and bacterial strains, can be secreted into 
the medium (yeast and gram positive bacteria) or into the 
peri piasmic space (gram negative bacteria) allowing recovery by less 
drastic procedures. Tissue culture cells as hosts also appear, in 
general, to permit reasonably facile recovery of heterologous 
proteins. 

When heavy: and light chain are coexpressed in the same host, the 
isolation procedure is designed so as to recover reconstituted 
antibody. This can be accomplished in vitro as described below, or 
might be possible in vivo in a microorganism which secretes the IgG 
chains out of the reducing environment of the cytoplasm. A more 
detailed description is given in D.2, below. 
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D.2 Chain Recombination Techniques 

The ability of the method of the invention to produce heavy and 
light chains or portions thereof, in isolation from each other 
offers the opportunity to obtain unique and unprecedented assemblies 
of immunoglobulins. Fab regions, and univalent antibodies. Such 
preparations require the use of techniques to reassemble isolated 
chains. Such means are known in the art, and it is, thus, 
appropriate to review them here. 

While single chain disulfide bond containing proteins have been 
reduced and reoxidized to regenerate in high yield native structure 
and activity (Freedman, R.B., et al . In Enzymology . of Post 
Translational Modification of Proteins , I: 157-212 (1980) Academic 
Press, NY.), proteins which consist of discontinuous polypeptide 
chains held together by disulfide bonds are more difficult to 
reconstruct in vitro after reductive cleavage. Insulin, a cameo 
case, has received much experimental attention over the years, and 
can now be reconstructed so efficiently that an industrial process 
has been built around it (Chance, R.E., et al., In Peptides. 
Proceedings of the Seventh Annual American Peptide Symposium (Rich, 
D.H. and Gross, E., eds.) 721-728, Pierce Chemical Co., Rockford, 

IL. (1981)). 

Immunoglobulin has proved a more difficult problem than 
insulin. The tetramer is stabilized intra and intermolecularly by 
15 Or more disulfide bonds. It has been possible to recombine heavy 
and light chains, disrupted by cleavage of only the interchain 
disulfides* to regain antibody activity even without restoration of 
the inter-chain disulfides (Edelman, G.M., et al., Proc. Natl. Acad. 
Sci. (USA) 50: 753 (1963)). In addition, active fragments of IgG 

formed by proteolysis (Fab fragments of ~50,000 MW) can be split 
into their fully reduced heavy chain and light chain components and 
fairly efficiently reconstructed to give active antibody (Haber, E., 
Proc. Natl. Acad. Sci. (USA) 52: 1099 (1964); Whitney, P.L., 
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et al_., Proc. Natl. Acad. Sci. (USA) 53: 524 (1965)). Attempts to 

reconstitute active antibody from fully reduced native IgG have been 
largely unsuccessful, presumably due to insolubility of the reduced 
chains and of side products or intermediates in the refolding 
5 pathway (see discussion in Freedman, M.H., et al♦, J. Biol. Chem . 
241: 5225 (1966)). If, however, the immunoglobulin is randomly 

modified by polyalanylation of its lysines before complete 
reduction, the separated chains have the ability to recover 
antigen-combining activity upon reoxidation (ibid). 

10 

A particularly suitable method for immunoglobulin reconstitution 
is derivable from the now. classical insulin recombination studies, 
wherein starting material was prepared by oxidative sulfitolysis, 
thus generating thiol-labile S-sulfonate groups at all cysteines in 
15 the protein, non-reductively breaking disulfides (Chance et al. 
(supra)). Oxidative siilfitolysis is a mild disulfide cleavage 
reaction (Means, G.E., et aK, Chemical Modification of Proteins , 
Holden-Day, San Francisco (1971)) which is sometimes more gentle 
than reduction, and which generates derivatives which are stable 
20 until exposed to mild reducing agent at which time disulfide 
reformation can occur via thiol-disulfide interchange. In the 
present invention the heavy and light chain S-sulfonates generated 
by oxidative sulfitolysis were reconstituted utilizing both air 
oxidation and thiol-disulfide interchange to drive disulfide bond 
25 formation. The general procedure is set forth in detail in U.S. 
Serial No. 452,187, filed Dec. 22, 1982 (EPO Appln. No. 

83.307840.5), incorporated herein by reference. 

D.3 Variants Permitted by Recombinant Technology 
30 Using the techniques described in paragraphs D.l and D.2, 

additional operations which were utilized to gain efficient 
production of mammalian antibody can be varied in quite 
straightforward and simple ways to produce a great variety of 
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modi fications of this basic antibody form. These variations are 
inherent in the use of recombinant technology, which permits 
modification at a genetic level of amino acid sequences in normally 
encountered mammalian immunoglobulin chains, and the great power of 
this approach lies in its ability to achieve these variations, as 
well as in its potential for economic and specific production of 
desired scarce, and often contaminated, molecules. The variations 
also inhere in the ability to isolate production of individual 
chains, and thus create novel assemblies. 

Briefly, since genetic manipulations permit reconstruction of 
genomic material in the process of construction of expression 
vectors, such reconstruction can be manipulated to produce new 
coding sequences for the components of "natural" antibodies or 
i rmiunogl obulins. As discussed in further detail below, the coding 
sequence for a mammalian heavy chain may not be derived entirely 
from a single source or single species, but portions of a sequence 
can be recovered by the techniques described in D.l from differing 
pools of mRNA, such as murine-murine hybridomas, human-murine 
hybridomas, or B cells differentiated in response to a series of 
antigen challenges. The desired portions of the sequences in each 
case can be recovered using the probe and analysis techniques 
described in D.l, and recombined in an expression vector using the 
same ligation procedures as .would be employed for portions of the 
same model sequence. Such chimeric chains can be constructed of any 
desired length; hence, for example, a complete heavy chain can be 
constructed, or only sequence for the Fab region thereof. 

The additional area of flexibility which arises from the use of 
recombinant techniques results.from the power to produce heavy and 
light chains or fragments thereof in separate cultures or of unique 
combinations of heavy and light chain in the same culture, and to 
prevent reconstitution of the antibody or immunoglobulin aggregation 
until the suitable components are assembled. Thus, while normal 
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antibody production results automatically in the formation of 
"mammalian antibodies" because the light and heavy chain portions 
are constructed in response to a particular determinant in the same 
cell, the methods of the present invention present the opportunity 
to assemble entirely new mixtures. Somewhat limited quantities of 
"hybrid" antibodies have been produced by "quadromas" i.e., fusions 
•of two hybridoma cell cultures which permit random assemblies of the 
heavy and light chains so produced. 

The present invention permits a more controlled assembly of 
desired chains, either by mixing the desired chains in vitro, or by 
transforming the same culture with the coding sequences for the 
desired chains. . 

0.4 Composite Immunoglobulins 

The foregoing procedure, which describes in detail the 
recombinant production of mammalian antibodies is employed with some 
modifications to construct the remaining types of antibodies or NSIs 
encompassed by the present invention. To prepare the particular 
embodiment of composite non-specific immunoglobulin wherein the 
homology of the chains corresponds to the sequences of 
immunoglobulins of different specificities, it is of course, only 
necessary to■prepare the heavy and light chains in separate cultures 
and reassemble them as desired. 

f 

i 

For example, in order to make an anti-CEA light chain/anti¬ 
hepatitis heavy chain composite antibody, a suitable source for the 
mRNA used as a template for the light chain clone would comprise, 
for instance, the anti CEA producing cell line of paragraph E.l. 

The mRNA corresponding to heavy chain would be derived from B cells 
raised in response to hepatitis infection or from hybridoma in which 
the B cell Was of this origin. It is clear that such composites can 
be assembled using the methods of the invention almost at will, and 
are limited only by available sources of mRNA suitable for use as 
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templates for the respective chains. All other features of the 
process are similar to those described above. 

0.5 Hybrid Antibodies 

5 Hybrid antibodies are particularly useful as they are capable of 

simultaneous reaction with more than one antigen. Pairs of heavy 
and light chains corresponding to chains of antibodies for different 
antigens, such as those set forth in paragraph 0.4 are prepared in 
four separate cultures, thus preventing premature assembly of the 
10 tetramer. Subsequent mixing of the four separately prepared 

peptides then permits assembly into the desired tetramers. While 
random aggregation may lead to the formation of considerable 
undesired product, that portion of the product in which homologous 
light and heavy chains are bound to each other and mismatched to 
15 another pair gives the desired hybrid antibody. 

0.6 Chimeric Antibodies 

For construction of chimeric antibodies (wherein, for example, 
the variable sequences are separately derived from the constant 
20 sequences) the procedures of paragraph D.l and D.2 are again 
applicable with appropriate additions and modifications. A 
preferred procedure is to recover desired portions of the genes 
encoding for parts of the heavy and light chains from suitable, 
differing, sources and then to religate these fragments using 
25 restriction endonucleases to reconstruct the gene coding for each 
chain. : 

For example, in a particularly preferred chimeric construction, 
portions of the heavy chain gene “and of the light chain gene which 
30 encode the variable sequences of antibodies produced by a murine 
hybridoma culture are recovered and cloned from this culture and 
gene fragments encoding the constant regions of the heavy and light 
chains for human antibodies recovered and cloned from, for example, 
human myeloma cells. Suitable restriction enzymes may then be used 
35 
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to ligate the variable portions of the mouse gene to the constant 
regions of the human gene for each of the two chains. The chimer 
chains are produced as set forth in D.l, aggregated as set forth 
D.2 and used in the same manner as the non-chimeric forms. Of 
course, any splice point in the chains can be chosen. 


ic 

in 


D.7 A1tered Antibodies 

Altered antibodies present, in essence, an extension of chimeric 
ones. Again, the techniques of D.l and 0.2 are applicabie; however, 
rather than splicing portions of the chain(s), suitable amino aci 
alterations-, deletions or additions are made using available 
techniques such as mutagenesis (supra). For example, genes which 
encode antibodies having diminished complement fixation properties, 
or which have enhanced metal binding capacities are prepared using 
such techniques. The latter type may, for example, take advantage 
of the known gene sequence encoding metalothionein 11 (Karin, K., 
et al Nature, 299: 797 (1982)). The chelating properties of this 
molecular fragment are useful in carrying heavy metals to tumor 
sites as ah aid in tumor imaging (Scheinberg, D.A., et al.. Scienc e, 

215: 19 (1982). 


0.8 Unavalent Antibodies 

In another preferred embodiment, antibodies are formed which 
comprise dhe heavy and light chain pair coupled with the Fc region 
of a third (heavy) chain. These antibodies have a particularly 
useful property. They can, like ordinary antibodies, be used to 
target antigenic surfaces of tissues, such as tumors, but, unlike 
ordinary Antibodies, they do not cause the antigenic surfaces of e 
target tissue to retreat and become non-receptive. Ordinary 
antibody 6se results in aggregation and subsequent inactivation, for 
several hours, of such surface antigens. 

The method of construction of univalent antibodies is a 
straightforward application of the invention. The gene for heavy 
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chain of the desired Fc region is cleaved by restriction enzymes, 
and only that portion coding for the desired Fc region expressed. 
This portion is then bound using the technique of D.2 to separately 
produced heavy chain the desired pairs separated from heavy/heavy 
and Fc/Fc combinations, and separately produced light chain added. 
Pre-binding of the two heavy chain portions thus diminishes the 
probability of formation of ordinary antibody. 

0.9 Fab Protein 

Similarly, it is not necessary to include the entire gene for 
the heavy chain portion. All of the aforementioned variations can 
be superimposed on a procedure for Fab protein production and the 
overall procedure differs only in that that portion of the heavy 
chain coding for the amino terminal 220 amino acids is employed in 
the appropriate expression vector. 

£• Specific Examples of Preferred Embodiments 

The invention has been described above in general terms and 
there follow several specific examples of embodiments which set 
forth details of experimental procedure in producing the desired 
antibodies. Example E.l sets forth the general procedure for 
preparing anti CEA antibody components, i.e. for a "mammalian 
antibody". Example E.3 sets forth the procedure for reconstitution 
and thus is applicable to preparation of mammalian, composite, 
hybrid and chimeric immunoglobulins, and Fab proteins and univalent 
antibodies. Example E.4 sets forth the procedure for tailoring the 
heavy or light chain so that the variable and constant regions may 
be derived from different sources. Example E.5 sets forth the 
method of obtaining a shortened fieavy chain genome which permits the 
production of the Fab regions and, in an analogous manner, Fc region. 

The examples set forth below are included for illustrative 
purposes and do not limit the scope of the invention. 
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E.l Construction of Expression Vectors for Murine anti-CE A 
Antibody Chains and Peptide Synthesis 
Carcinoembryo-uc antigen (CEA) Is associated with the surface of 
certain tumor cells of human origin (Gold, P., et al.., J. E*£. «ed.. 
122: 467 (1965)). Antibodies which bind to CEA (anti-CEA anti¬ 

bodies) are useful in early detection of these tumors (Van Nagell, 

T.Ri, et aK, Cancer Res . 40: 502 (1980)), and have the potential 

for use~in treatment of those human tumors which appear to support 
CEA at their surfaces. A mouse hybridoma cell line which secretes 
anti-CEA antibodies of the Ig n class, CEA.66-E3, has been prepared 
as described by Wagener, C. et al_., J♦ Immunol,. 130, 2308 (1983) which 
is incorporated herein by reference, and was used as mRNA source. The 
production of anti CEA antibodies by this cell line was determined. 

The N-terminal sequences of the antibodies produced by these cells 
was compared with those of monoclonal anti CEA as follows. Purified 
IgG was treated with PCAse (Podell, D.N., et al_- > Biochem. Biophys. 
Res. Commun. 81: 176 (1978)), and then dissociated in 6M guanidine 

hydrochloride, 10 mM 2-mercaptoethanol (1.0 mg of immunoglobulin, 5 
min, 100°C water bath). The dissociated chains were separated on a 
20 Waters Associates alkyl phenyl column using a linear gradient from 
100 percent A (0.1 percent TFA-water) to 90 percent B (TFA/HgO/MeCN 
0.1/9.9/90) at a flow rate of 0.8 ml/min. Three major peaks were 

eluted and analyzed on SDS gels by silver staining. The first two 

peaks were pure light chain (MW 25,000 dal tons), the third peak 
25 showed a (7:3) mixture of heavy and light chain. 1.2 nmoles of light 
chain were sequenced by the method of Shively, J.E., Methods in, 
Enzymology, 79: 31 (1981), with an NH 2 -terminal yield of 0.4 

nmoles. A mixture of heavy and 14ght chains (3 nmoles) was also 
sequenced, and sequence of light chain was deducted from the double 
30 sequence to yield the sequence of the. heavy chain. 

In the description which follows, isolation and expression of the 
genes for the heavy and light chains for anti CEA antibody produced 
by CEA.66-E3 are described. As the constant regions of these chains 
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belong to the gamma and kappa families, respectively, "light chain" 
and "kappa chain", and “heavy chain" and "gamma chain", 
respectively, are used interchangeably below. 

E.1.1 Isolation of Messenger RNA for Anti CEA Light and Heavy 
(Kappa and Gamma) Chains 

Total RNA from CEA.66-E3 cells was extracted essentially as 
reported by Lynch et al, Virology , 98: 251 (1979). Cells were 

pelleted by centrifugation and approximately 1 g portions of pellet 
resuspended in 10 ml of 10 mM NaCl, 10 mM Tris HC1 (pH 7.4), 1.5 mM 
MgC^. The resuspended cells were lysed by addition of non-ionic 
detergent NP-40 to a final concentration of 1 percent, and nuclei 
removed by centrifugation. After addition of SOS (pH 7.4) to 1 
percent final concentration, the supernatant was extracted twice 
with 3 ml portions of phenol (redistilledj/chloroform: isoamyl 
alcohol 25:1 at 4 9 C. The aqueous phase was made 0.2 M in NaCl and 
total RNA was precipitated by addition of two volumes of 100 percent 
ethanol and overnight storage at -20°C. After centrifugation, polyA 
mRNA was purified from total RNA by oligo-dT cellulose 
chromatography as described by Aviv and Leder, Proc. Nat‘1. Acad. 
Sci. (USA) , 69: 1408 (1972). 142 pg of polyA mRNA was obtained 

from 1 g cel 1s. 

E.1.2 Preparation of E. coli Colony Library Containing 

Plasmids with Heavy and Light DNA Sequence Inserts 

5 m 9 of the unfractionated polyA mRNA prepared in paragraph 
E.1.1 was used as template for oligo-dT primed preparation of 
double-stranded (ds) cDNA by sta'ndard procedures as described by 
Goeddel et al.. Nature 281: 544 (1979) and Wickens et al., J. Biol. 

Chem. 253: 2483 (1978) incorporated herein by reference. The cDNA 

was size fractionated by 6 percent polyacrylamide gel 
electrophoresis and 124 ng of ds cDNA greater than 600 base pairs in 
length was recovered by electroelution. A 20 ng portion of ds cDNA 
was extended with deoxy C residues using terminal deoxynucleotidyl 
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transferase as described in Chang et al.. Nature 275: 617 (1978) 

incorporated herein by reference, and annealed with 200 ng of the 
plasmid pBR322 (Bolivar et al.. Gene 2: 95 (1977)) which had been 

cleaved with Pst I and tailed with deoxy G. Each annealed mixture 
was then transformed into £. coli K12 strain 294 (ATCC No. 31446). 
Approximately 8500 ampicillin sensitive, tetracycline resistant 
transformants were obtained. 

E.1.3 Preparation of Synthetic Probes 

The 14mer, 5* GGTGGGAAGATGGA 3* complementary to the coding 
sequence of constant region for mouse M0PC21 kappa chain which 
begins 25 basepairs 3‘ of the variable region DNA sequence was used 
as kappa chain probe. A 15 mer, 5' GACCAGGCATCCCAG 3', 
complementary to a coding sequence located 72 basepairs 3' of the 
variable region DNA sequence for mouse M0PC21 gamma chain was used 

to probe gamma chain gene. 

Both probes were synthesized by the phosphotriester method 
described in German Offenlegungschrift 2644432, incorporated herein 
by reference, and made radioactiy,e by k inasing as follows: 250. ng 
of deoxyol igonucleotide were combined in 25 yl of 60 mM Tris HC1 
(pH 8), 10 mM MgCl 2 , 15 mM beta-mercaptoethanol, and 100 pCi 
^yp (Amer s ham, 5000 Ci/mMole). 5 units of T4 
polynucleotide kinase were added and the reaction was allowed to 
proceed at 37°C for 30 minutes alnd terminated by addition of EOTA to 

20 mM. 


E.1.4 Screening of Colony Library for Kappa or Gamma Chain 
Sequences 

-2000 colonies prepared as described in paragraph E.1.2 were 
individually inoculated into wells of microtitre dishes containing 
LB (Miller, Experiments in Molecular Genetics, p. 431-3, Cold Spring 
Harbor Lab., Cold Spring Harbor, New York (1972)) ♦ 5 ug/ml 
tetracycline and stored at -20’C after addition of DMS0 to 7 
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percent, individual colonies from this 1ibrary were transferred to 
duplicate sets of Schleicher and Schuell BA85/20 nitrocellulose 
filters and grown on agar plates containing LB + 5 ug/ml 
tetracycline. After ~10 hours growth at 37 C the colony filters 
were transferred to agar plates containing LB + 5 ug/ml tetracycline 
and 12.5 ug/ml chloramphenicol and reincubated overnight at 37*C. 

The ON A from each colony was then denatured and fixed to the filter 
by a modification of the Grunstein—Hogness procedure as described in 
Grunstein et al., Proc. Natl. Acad. Sci. (USA) 72: 3961 (1975), 
incorporated herein by reference. Each filter was floated for 3 
minutes on 0.5 N NaOH, 1.5 M NaCl to lyse the colonies and denature 
the ONA then neutralized by floating for 15 minutes on 3 M NaCl, 0.5 
M Tris HC1 (pH 7.5),. The filters were then floated for an 
additional 15 minutes on 2XSSC, and subsequently baked for 2 hours 
in an 80 C vacuum oven. The filters were prehybridized for ~2 hours 
at room temperature in 0.9 M NaCl, IX Denhardts, 100 mM Tris HC1 (pH 
7.5), 5 mM Na-EDTA, 1 mM ATP, 1 M sodium phosphate (dibasic), 1 mM 
sodium pyrophosphate, 0.5 percent NP-40, and 200 ug/ml E. coli 
t-RNA, and hybridized in the same solution overnight, essentially as 
described by Wallace eji aK Nucleic Acids Research 9: 879 (1981) 

using ~40xl0^ cpm of either the kinased kappa or gamma probe 
described above. 

After extensive washing at 37*C in 6X SSC, 0.1 percent SOS, the 
filters were exposed to Kodak XR-5 X-ray film with DuPont 
Li c^itn ing-Pl us intensifying screens for 16-24 hours at -80°C. 
Approximately 20 colonies which hybridized with kappa chain probe 
and 20 which hybridized with gamma chain probe were characterized. 

E.1.5 Characterization of Colonies which Hybridize to Kappa 
DNA Sequence Probe 

Plasmid DNAs isolated from several different transformants which 
hybridized to kappa chain probe were cleaved with Pst I and 
fractionated by polyacrylamide gel electrophoresis (PAGE). This 
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analysis demonstrated that a number of plasmid ONAs contained cDNA 
inserts large enough to encode full length kappa chain. The 
complete nucleotide sequence of the cDNA insert of one of these 
plasmids was determined by the dideoxynucleotide chain termination 
method as described by Smith, Methods Enzymol. 65 , 560 (1980) 
incorporated herein by reference after subcloning restriction 
endonuclease cleavage fragments into M13 vectors (Messing et al.. 
Nucleic Acids Research 9: 309 (1981). Figure 2 shows the nucleotide 
sequence of the cDNA insert of pKl7G4 and Figure 3 shows the gene 
sequence with the corresponding amino acid sequence. Thus, the 
entire coding region of mouse anti—CEA kappa chain was isolated on 
this one large DNA fragment. The amino acid sequence of kappa 
chain, deduced from the nucleotide sequence of the pK17G4 cDNA 
insert, corresponds perfectly with the first 23 N-terminal amino 
acids of mature mouse anti—CEA kappa chain as determined by amino 
acid sequence analysis of purified mouse anti—CEA kappa chain. The 
coding region of pKl7G4 contains 27 basepairs or 9 amino acids of 
the presequence and 642 basepairs or 214 amino acids of the mature 
protein. The mature unglycosylated protein (MW 24,553) has a 
variable region of 119 amino acids, including the J1 joining region 
of 12 amino acids, and a constant region of 107 amino acids. After 
the stop codon behind amino acid 215 begins 212 basepairs of 3' 
untranslated sequence up to the polyA addition. The kappa chain 
probe used to identify pl(17G4 hybridizes to nucleotides 374-388 
(figure 2). 


E.1.6 Characterization of Colonies which Hybridize to Gamma 1 
DNA Probe 

Plasmid DNA isolated from several transformants positive for 
hybridization with the heavy chain gamma 1 probe was subjected to 
Pst I restriction endonuclease analysis as described in E.1.5. 
Plasmid DNAs demonstrating the largest cDNA insert fragments were 
selected for further study. Nucleotide sequence coding for mouse 
heavy (gamma-1) chain, shows an Ncol restriction endonuclease 


0312L 


0125023 


-36- 


cleavage site near the junction between variable and constant 
region. Selected plasmid ONAs were digested with both PstI and Ncol 
and sized on polyacryl amide. This analysis allowed identification 
of a number of plasmid ONAs that contain Ncol restriction 
endonuclease sites, although none that demonstrate cDNA insert 
fragments large enough to encode the entire coding region of mouse 
anti-CEA heavy chain. 

In one plasmid isolated, p y298 the cDNA insert of about 1300 bp 
contains sequence information for the 5' untranslated region, the 
signal sequence and the N-terminal portion of heavy chain. Because 
Py298 did not encode the C-terminal sequence for mouse anti-CEA 
gamma 1 chain, plasmid DNA was isolated from other colonies and 
- screened with PstI and Ncol. The C-terminal region of the cDNA 
insert of pyll was sequenced and shown to contain the stop codon, 3' 
untranslated sequence and that portion of the coding sequence 
missing from p y298. 

Figure 4 presents the entire nucleotide sequence of mouse 
anti-CEA heavy chain (as determined by the dideoxynucleotide chain 
termination method of Smith, Methods Enzymo l., 65: 560 (1980)) and 

Figure 5 includes the translated sequence. 

The amino acid sequence of gamma 1 (heavy chain) deduced from 
the nucleotide sequence of the py298 cDNA insert corresponds 
perfectly to the first 23 N-terminal amino acids of mature mouse 
anti-CEA gamma 1 chain as determined by amino acid sequence analysis 
of purified mouse anti-CEA gamma-1 chain. The coding region 
consists of 57 basepairs or 19 amino acids of presequences and 1346 
basepairs or 447 amino acids of mature protein. The mature 
unglycosolated protein (MW 52,258) has a variable region of 135 
amino acids, including a D region of 12 amino acids, and a J4 
joining region of 13 amino acids. The constant region is 324 amino 
acids. After the stop codon behind amino acid 447 begins 96 bp of 
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3* untranslated sequences up to the polyA addition. The probe used 
to identify Py298 and pyll hybridized to nucleotides 528-542 (Figure 
4). 


E.1.7 Construction of a Plasmid For Direct Expression of Mouse 
Mature Anti-CEA Kappa Chain Gene, pKCEAtrp207-l* 

Figure 6 illustrates the construction of pKCEAtrp207-l* 

First, an intermediate plasmid pHGH207-l*, having a single trp 
Dromoter, was prepared as follows: 

The plasmid pHGH 207 (described in U.S. Pat. Appl. Serial No. 

307,473, filed Oct. 1, 1981 (EPO Publn. No. 0036776)) has a double 

lac promoter followed by the trp promoter, flanked by EcoR I sites 

and was used to prepare pHGH207-l. pHGH207 was digested with BamH 

1, followed by partial digestion with EcoR I. The largest fragment, 

which contains the entire trp promoter, was isolated and ligated to 

the largest EcoR I- BamH I fragment from pBR322, and the ligation 

R R 

mixture used to transform E. coli 294. Tet Amp colonies were 
isolated, and most of them contained pHGH207-l. pHGH207-l* which 

D 

lacks the EcoRl site between the amp gene and the trp promoter, 
was obtained by partial digestion of pHGH207-l with EcoR I, filling 
in the ends with Klenow and dNTPs, and religation. 

5 pg of pHGH207-l* was digested with EcoRl, and the ends 
extended to blunt ends using 12 units of DNA Polymerase I in a 50 pi 
reaction containing 60 mM NaCl, 7’mM MgC^, 7 mM Tris HC1 (pH 7.4) 
and 1 mM in.each dNTP at 37°C for 1 hour, followed by extraction 
with phenol/CHC1^ and precipitation with ethanol. The 
precipitated DNA was digested with BamH I, and the large vector 
fragment (fragment 1) purified using 5 percent polyacrylamide gel 
electrophoresis, electroelution, phenol/CHClg extraction and 
ethanol precipitation. 
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The DNA was resuspended in 50 pi of 10 mM Tris pH 8, 1 mM EUTA 
and treated with 500 units Bacterial Alkaline Phosphatase (BAP) for 
30* at 65* followed by phenol/CHCl 3 extraction and ethanol 

precipitation. 

A DNA fragment containing part of the light chain sequence was 
prepared as follows: 7 gg of pKl7G4 DNA was digested with Pst I and 
the kappa chain containing cDNA insert was isolated by 6 percent gel 
electrophoresis, and electroelution. After phenol/CHCl^ 
extraction, ethanol precipitation and resuspension in water, this 
fragment was digested with Ava II. The 333 bp Pst I-Ava II DNA 
fragment was isolated and purified from a 6 percent polyacrylamide 
gel. 


A 15 nucleotide DNA primer was synthesized by the 
phosphotriester method G. 0. 2,644,432 (supra) and has the following 

sequence: 

Met Asp lie Val .Met 
5’ ATG GAC ATT GTT ATG 3‘ 

The 5' methionine serves as the initiation codon. 500 ng of 
this primer was phosphorylated at the 5' end with 10 units T4 DNA 
kinase in 20 pi reaction containing 0.5 mM ATP. -200 ng of the Pst 
I-Ava II DNA fragment was mixed with the 20 pi of the phosphoryl ated 
primer, heated to 95*C for 3 minutes and quick frozen in a dry-ice 
ethanol bath. The denatured DNA solution was made 60nM NaCl, 7mM 
MgCl 2 , 7 rrfl Tris HC1 (pH 7.4), 12 mM in each dNTP and 12 units DNA 
Polymerase, I-Large Fragment was added. After 2 hours incubation at 
37*C this primer repair reaction was phenol/CHCl^ extracted, 
ethanol precipitated, and digested to completion with Sau 3A. The 
reaction mixture was then electrophoresed on a 6 percent 
polyacrylamide gel and -50 ng of the 182 basepair amino-terminal 
blunt-end to Sau 3A fragment (fragment 2) was obtained after 
electroelution. 
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100 ng of fragment 1 (supra) and 50 ng of fragment 2 were 
combined in 20 W 1 of 20 mM Tris HC1 (pH 7.5), 10 mM MgCl 2 , 10 mM 
DTT, 2.5 mM ATP and 1 unit of T4 ONA ligase. After overnight 
ligation at 14*C the reaction was transformed into £. col_l K12 
strain 294. Restriction endonuclease digestion of plasmid DMA from 
a number of ampicillin resistant transformants indicated the proper- 
construction and ONA sequence analysis proved the desired nucleoti e 
sequence through the initiation codon of this new plasmid, PKCEAIntl 

(Figure 6). 

The remainder of the coding sequence of the kappa light chain 
gene was prepared as follows: 

The Pst 1 cDNA insert fragment from 7 ug of K17G4 ONA was 
partially digested with Ava II and the Ava II cohesive ends were 
extended to blunt ends in a DNA Polymerase I large fragment 
reaction. Following 6 percent polyacrylamide gel electrophoresis 
the 686 basepair Pst I to blunt ended Ava II ONA fragment was 
isolated, purified and subjected to Hpa II restriction endonuclease 
digestion. The 497 basepair Hpa II to blunt ended Ava II DNA 
fragment (fragment 3) was isolated and purified after gel 
electrophoresis. 

10 wg of pKCEAIntl ONA was digested with Ava I, extended with 
DNA polymerase I large fragment, and digested with Xba I. Both the 
large blunt ended Ava I to Xba I vector fragment and the small blunt 
ended Ava I to Xba I fragment were isolated and purified from a 6 
percent polyacrylamide gel after electrophoresis. The large vector 
fragment (fragment 4) was treated"with Bacterial Alkaline 
Phosphatase (BAP), and the small fragment was digested with Hpa II, 
electrophoresed on a 6 percent polyacrylamide and the 169 basepair 
Xba I-Hpa II DNA fragment (fragment 5) was purified. ”75 ng of 
fragment 4, ~50 ng of fragment 3 and ”50 ng of fragment 5 were 
combined in a T4 DNA ligase reaction and incubated overnight at 14 
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and the reaction mixture transformed into E. coli K12 strain 294. 
Plasmid DNA from six ampicillin resistant transformants were 
analyzed by restriction endonuclease digestion. One plasmid DNA 

demonstrated the proper construction and was designated pKCEAInt2. 

5 

Final construction was effected by ligating the K-CEA fragment, 
including the trp promoter from pKCEAInt2 into pBR322(XAP). 
(pBR322(XAP) is prepared as described in U.S. Application 452,227, 
filed December 22, 1982; from pBR322 by deletion of the Aval-PvuII 
10 fragment followed by ligation.) 

The K-CEA fragment was prepared by treating pKCEAInt2 with 
Ava I, blunt ending with DNA polymerase I (Klenow fragment) in the 
...presence of DNTPs, digestion with Pst I and isolation of the desired 
I 5 fragment by gel electrophoresis and electroelution. 

The large vector fragment from pBR322(XAP) was prepared by 
successive treatment with EcoR I, blunt ending with polymerase, and 
redigestion with Pst I, followed by isolation of the large vector 
20 fragment by electrophoresis and electroelution. 

The K-CEA and large vector fragments as prepared in the 
preceding paragraphs were ligated with T4 DNA ligase, and the 
ligation mixture transformed into £. coli as above. Plasmid DNA 
25 from several ampicillin resistant transformants were selected for 
analysis, and one plasmid DNA demonstrated the proper construction, 
and was designated pKCEAtrp207-I*. 

E.1.8 Construction of a Plasmid Vector for Direct Expression 
30 of Mouse Mature Anti-CEA Heavy (Gamma 1) Chain Gene, 

PvCEAtrp207-l* 

Figure 7 illustrates the construction of pyCEAtrp207-l*. This 
plasmid was constructed in two parts beginning with construction of 
the C-terminal region of the gamma 1 gene. 
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5 gg of plasmid pHGH207-l* was digested with Ava I, extended to 
blunt ends with DNA polymerase 1 large fragment (Klenow fragment), 
extracted with phenol/CHCl-j, and ethanol precipitated. The DNA 
was digested with BamH I treated with BAP and the large fragment 
(fragment A) was purified by 6 percent polyacrylamide gel 
electrophoresis and electroelution, 

-5 M g of PyII was digested with Pst I and the gamma chain cDNA 
insert fragment containing the C-terminal portion of the gene was 
purified, digested with Ava II followed by extension of the Ava II 
cohesive ends with Klenow, followed by Taq I digestion. The 375 
basepair blunt ended Ava II to Taq I fragment (fragment B) was 
isolated and purified by gel electrophoresis and electroelution. 

9 gg of pv298 was digested with Taq I and BamH I for isolation 
of the 496 basepair fragment (fragment C). 

Approximately equimolar amounts of fragments A, B, and C were 
ligated overnight at 14* in 20gl reaction mixture, then transformed 
into E. coli strain 294. The plasmid DNA from six ampicillin 
resistant transformants was committed to restriction endonuclease 
analysis and one plasmid DNA, named pyCEAInt, demonstrated the 
correct construction of the C-terminal portion of gamma 1 (Figure 5). 

To obtain the N-terminal sequences, 30 gg of Py298 was digested 
with Pst I and the 628 basepair DNA fragment encoding the N-terminal 
region of mouse anti-CEA gamma chain was isolated and purified. 

This fragment was further digested with Al u I and Rsa I for 
isolation of the 280 basepair fragment. A 15 nucleotide DNA primer 
met gl u val met leu 
' 5' ATG GAA GTG ATG CTG 3' 

was synthesized by the phosphotriester method (supra). 

1 

The 5* methionine serves as the initiation codon. 500 ng of 
this synthetic oligomer primer was phosphorylated at the 5 1 end in a 
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reaction with 10 units T4 DNA kinase containing 0.5 mM ATP in 20yl 
reaction mixture. -500 ng of the 280 basepair Alu I-Rsa I DNA 
fragment was mixed with the phosphorylated primer. The mixture was 
heat denatured for 3 minutes at 95° and quenched in dry-ice 
ethanol. The denatured DNA solution was made 6QnM NaCl, 7mM 
MgCl 2 , 7 mM Tris HC1 (pH 7.4), 12 mM in each dNTP and 12 units DNA 
Polymerase I-Large Fragment was added. After 2 hours incubation at 
37°c, this primer repair reaction was phenol/CHCl 3 extracted, 
ethanol precipitated, and digested to completion with Hpall. ~50 ng 
of the expected 125 basepair blunt-end to Hpa II DNA fragment 
(fragment D) was purified from the gel. 

A second aliquot of p Y 298 DNA was digested with Pst I, the 628 
basepair DNA fragment purified by polyacrylamide gel 
electrophoresis, and further digested with BamH I and Hpa II. The 
resulting 380 basepair fragment (fragment E) was purified by gel 
electrophoresis. 

-5 u g of p-fCEAIntl was digested with EcoR I, the cohesive ends 
were made flush with DNA polymerase I (Klenow), further digested 
with BamH I, treated with BAP and electrophoresed on a 6 percent 
polyacrylamide gel. The large vector fragment (fragment F) was 
isolated and purified. 

In a three fragment ligation, 50 ng fragment D, 100 ng fragment 
E, and 100 ng fragment F were ligated overnight at 4° in a 20 yl 
reaction mixture and used to transform ji. coli K12 strain 294. The 
plasmid DNAs from 12 ampicillin resistant transformants were 
analyzed for the correct construction and the nucleotide sequence 
surrounding the initiation codon was verified to be correct for the 
plasmid named p Y CEAInt2. 

The expression plasmid, p Y CEAtrp207-I* used for expression of 
the heavy chain gene is prepared by a 3-way ligation using the large 
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vector fragment from pBR322(XAP) (supra) and two fragments prepared 
from pyCEAlnt2. 

pBR322(XAP) was treated as above by digestion with EcoRl, blunt 
ending with DNA polymerase (Klenow) in the presence of dNTPs, 
followed by digestion with Pst I, and isolation of the large vector 
• fragment by gel electrophoresis. A 1543 base pair fragment from 
PyCEAInt2 containing trp promoter linked with the N-terminal coding 
region of the heavy chain gene was isolated by treating pyCEAInt2 
with Pst I followed by BamH I, and isolation of the desired fragment 
using PAGE. The 869 base pair fragment containing the C-terminal 
coding portion of the gene was prepared by partial digestion of 
pyCEAInt2 with Ava I, blunt ending with Klenow, and subsequent 
digestion with BamH 1, followed by purification of the desired 
fragment by gel electrophoresis. 

The aforementioned three fragments were then ligated under 
standard conditions using T4 DNA ligase, and a ligation mixture used 
to transform ji. col i strain 294. Plasmid DNAs from several 
tetracycline resistant transformants were analyzed; one plasmid DNA 
demonstrated the proper construction and was designated 
pyCE Atrp207-1*. 

E.1.9 Production of Immunoglobulin Chains by E. col i 
£. coli strain W3110 (ATTC No. 27325) was transformed with 
pyCEAtrp207-l* or pKCEAtrp207-1* using standard techniques. 


To obtain double transformants, col i strain W3110 cells were 

transformed with a modified pKCEAtrp207-l*, pKCEAtrp207-l*A, which 

R 

had been modified by cleaving a Pst I-Pvu I fragment from the amp 
gene and religating. Cells transformed with pKCEAtrp207-l*a are 
thus sensitive to ampicillin but still resistant to tetracycline. 

I 

Successful transformants were retransformed using pyCEAInt2 which 
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confers resistance to ampicillln but not tetracyc1ine. Cells 
containing both pKCEAtrp20/-l*a and p Y CEAInt2 thus identified by 
growth in a medium containing both ampicillin and tetracycline. 

To confirm the production of heavy and/or light chains in the 
transformed cells, the cell samples were inoculated into M9 
tryptophan free medium containing 10ug/ml tetracycline, and induced 
with indoleacrylic acid (IAA) when the OD 550 reads 0.5. The 
induced cells were grown at 37*C during various time periods and 
then spun down, and suspended in TE buffer containing 2 percent 5DS 
and 0.1 M B-mercaptoethanol and boiled for 5 minutes. A 10 x volume 
of acetone was added and the cells kept at 22*C for 10 minutes, then 
centrifuged at 12,000 rpm. The precipitate was suspended in 
O'Farrell SDS sample buffer (O'Farrell, P.H., J. Biol. Chem. , 250: 
4007 (1975)); boiled 3 minutes, recentrifuged, and fractionated 
using SDS PAGE (10 percent), and stained with silver stain (Goldman, 
D. et a 1., Science 211: 1437 (1981)); or subjected to Western blot 

using rabbit anti-mouse IgG (Burnett, W. N., et^ , Anal . Biochem . 
112: 195 (1981)-), for identification light chain and heavy chain. 


13 

m 
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25 


Cells transformed with p Y CEAtrp207-l* showed bands upon SDS PAGE 
corresponding to heavy chain molecular weight as developed by silver 
stain. Cells transformed with pKCEAtrp207-l* showed the proper 
molecular weight band for light chain as identified by Western blot; 
double transformed cells showed bands for both heavy and light chain 
molecular weight proteins when developed using rabbit anti-mouse IgG 
by Western blot. These results are shown in Figures 8A, 88, and 8C. 
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Figure 8A shows results developed by silver stain from cells 
transformed with p Y CEAtrp207-l*. Lane 1 is monoclonal anti-CEA 
heavy chain (standard) from CEA.66-E3. Lanes 2b-5b are timed 
samples 2 hrs, 4 hrs, 6 hrs, and 24 hrs after IAA addition. Lanes 
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2a-5a are corresponding un transformed controls. Lanes 2c 5c are 
corresponding uninduced transformants. 

Figure 8B shows results developed by Western blot from cells 
transformed with pKCEAtrp207-l*. lanes 1 b-6b are extracts from 
, induced cells Immediately, lhr. 3.5 hrs, 5 hrs, 8 hrs, and 24 hrs 
after IAA addition, and la-6a corresponding uninduced contro s. 
lane 7 is an extract from a p Y CEAtrp207-l* control, 

10 are varying amounts of anti CEA-kappa chain from CEA.66-E3 cells. 

Figure 8C shows results developed by Western blot from four 
colonies of double transformed cells 24 hours after IAA addition 
(lanes 4-7). Lanes 1-3 are varying amounts of monoclonal gamma 
■chain controls, lanes 8 and 9 are untransformed and prCEAtrp207-l* 
transformed cell extracts, respectively. 

• In another quantitative assay, frozen, transformed E. coll cells 
grown according to E.1.10 (below) were lysed by heating in sodium 
dodecyl sulfate (SDS)/e-mercaptoethano1 cell lysis buffer at 100 . 
Aliquots were loaded on an SDS polyacrylamide gel next to lanes 
loaded with various amounts of hybridoma anti-CEA. Th^gel w * s 
developed by the Western blot, Burnett (supra), using l ~' abe ' e6 
sheep anti-mouse IgG antibody from New England Nuclear. The resu 
are shown in Figure 9. The figure shows that the E. colj_ pro uc s 
co-mi grate with the authentic hybridoma chains, indicating no 
detectable proteolytic degradation in E. coll- Heavy chain from 
mammalian cells is expected to be slightly heavier than E. colj. 
material du.e to glycosylation in the former. Using the h^doma 
lanes as a standard, the following estimates of heavy and light 

1 chain production were made: . , Ppr aram of ceils) 

E. col1 (W3110/ P YCEAtrp207-l*) 5 mg T 

E. coli (W3110/pKCEAtrp207-l*) 1,5 m9 K 

E. coli (W3110/pKCEAtrp207-l*a, pyCEAInt2) 0.5 mg K, 1.0 mg y 
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E.1.10 Recons titution of Antibody from Recombinant K and Gamma 
Chains 

In order to obtain heavy and light chain preparations for 
reconstitution, transformed cells were grown in larger batches, 

5 harvested and frozen. Conditions of growth of the variously 
transformed cells were as follows: 

E. coli (W3110/p-rCEAtrp207-l*) were inoculated into 500 ml LB 

medium containing 5ug/ml tetracycline and grown on a rotary shaker 

0 for 8 hours. The culture was then transferred to 10 liters of 

fermentation medium containing yeast nutrients, salts, glucose, and 

2ug/ml tetracycline. Additional glucose was added during growth and 

at OD 550 = 20, indoleacrylic (IAA), a trp derepressor, was added to 

a concentration of 50 pg/ml. The cells were fed additional glucose 

,, to a final OD 550 = 40, achieved approximately 6 hours from the IAA 
1 b 

addition. 

E. coli (W3110) cells transformed with pKCEA trp 207-1* and 
double transformed (with pKCEAtrp207-l*a and p Y CEAInt2) were grown 
20 in a manner analogous to that described above except that the 00 550 
six hours after IAA addition at harvest was 25-30. 

The cells were then harvested by centrifugation, and frozen. 

25 E.2 Assay Method for Reconstituted Antibody 

Anti-CEA activity was determined by ELISA as a criterion for 
successful reconstitution. Wells of microtiter plates (Dynatech 
Immulon) were saturated with CEA by incubating 100 ul of 2-5 vg 
CEA/ml solution in 0.1M carbonate buffer, pH 9.3 for 12 hours at 
3 q room temperature. The wells were then washed 4 times with phosphate 
buffered saline (PBS), and then saturated with BSA by incubating 200 
pi of 0.5 percent BSA in PBS for 2 hours at 37°C, followed by 
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washing 4 times with PBS. Fifty microliters of each sample was 
applied to each well. A standard curve (shown in Figure 10), was 
run, which consisted of 50 ul samples of 10 pg, 5 ug, 1 ug, 500 ng, 
100 ng, 50 ng, 10 ng, 5 ng and 1 ng anti-CEA/ml in 0.5 percent BSA 
in PBS, plus 50 pi of 0.5 percent BSA in PBS alone as a blank. All 
of the samples were incubated in the plate for 90 minutes at 37*C. 



10 


15 


20 


25 


30 


The plates were then washed 4 times with PBS, and sheep 
anti-mouse IgG-alkaline phosphate (TAG0, Inc.) was applied to each 
well by adding 100 ul of an enzyme concentration of 24 units/ml in 
0.5 percent BSA in PBS. The solution was incubated at 37*C for 90 
minutes. The plates were washed 4 times with PBS before adding the 
substrate,-100 pi of a 0.4 mg/ml solution of p-nitrophenylphosphate 
(Sigma) in ethanolamine buffered saline, pH 9.5. The substrate was 
incubated 90 minutes at 37*C for color development. 


The A^q of each well was read by the Microelisa Auto Reader 
(Dynatech) set to a threshold of 1.5, calibration of 1.0 and the 0.5 
percent BSA in PBS (Blank) well set to 0.000. The A^^ data was 
tabulated in RS-1 on the VAX system, and the standard curve data 
fitted to a four-parameter logistic model. The unknown samples' 
concentrations were calculated based on the A^j-q data. 

E.3 Reconstitution of Recombinant Antibody and Assay 

Frozen cells prepared as described in paragraph E.1.10 were 
thawed in cold lysis buffer [lOmM Tris HC1, pH 7.5, ImM EDTA, 0.1M 
NaCl, ImM phenyl methylsulfonyl fluoride (PMSF)] and lysed by 
sonication.' The lysate was partially clarified by centrifugation 
for 20 mins at 3,000 rpm. The supernatant was protected from 
proteolytic enzymes by an additional ImM PMSF, and used immediately 
or stored frozen at -80°C; frozen lysates were never thawed more 
than once. 
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The'S-sulfonate of Ji. col i produced anti-CEA heavy chain (y) was 
prepared as follows: Recombinant coli cells transformed with 
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PrCEAtrp207-l* which contained heavy chain as insoluble bodies, were 
lysed and centrifuged as above; the pellet was resuspended in the 
same buffer, sonicated and re-centrifuged. This pellet was washed 
once with buffer, then suspended in 6M guanidine HC1, 0.1M Tris HC1, 
pH 8, ImM EDTA, 20 mg/ml sodium sulfite and 10 mg/ml sodium 
tetrathionate and allowed to react at 25* for about 16 hrs. The 
reaction mixture was dialyzed against 8M urea, 0.1M Tris HC1, pH 8, 

and stored at 4 , to give a 3 ing/ml solution of Y -SSO . 

3 

650 ul of cell lysate from cells of various E. coli strains 
producing various IgG chains, was added to 500 mg urea. To this was 
added e-mercaptoethanol to 20mM, Tris-HCl, pH 8.5 to 50mM and EDTA 
to ImM, and in some experiments, Y -SS0 3 was added to 0.1 mg/ml. 

After standing at 25 for 30-90 mins., the reaction mixtures were 
dialyzed at 4' against a buffer composed of 0.1M sodium glycinate, 
pH 10.8, 0.51*1 urea, lOmM glycine ethyl ester, 5mM reduced 
glutathione, O.lmM oxidized glutatnione. This buffer was prepared 
from f^-saturated water and the dialysis was performed in a capped 
Wheaton bottle.. After 16-48 hours, dialysis bags were transferred 
to 4 phosphate buffered saline containing ImM PMSF and dialysis 
continued another 16-24 hrs. Dialysates were assayed by ELISA as 
described in paragraph E.2 for ability to bind CEA. The results 
below show the values obtained by comparison with the standard curve 
in x ng/ml anti-CEA. Also shown are the reconstitution efficiencies 
calculated from the ELISA responses, minus the background (108 
ng/ml) of cells producing K chain only, and from estimates of the 
levels of y and K cnains in the reaction mixtures. 

ng/ml Percent 

anti-CEA reco mbination 

J:* col i W3110 producing IFN—oA (control) 

E_. coli (W3110/pKCEAtrp207-l*) 

I* co1i (W3110/pKCEAtrp207-l*), plus y-SS0 3 

E. coli (W3110/pKCEAtrp207-l*a, p Y CEAInt2) 

Hybridoma anti-CEA K-SSO., and y-SSO 

3 T 3 


0 

— 

108 

— 

848 

0.33 

1580 

0.76 

540 

0.40 
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E.4 Preparation of Chimeric Antibody 

Figures 11 and 12 show the construction of an expression vector 
for a chimeric heavy (gamma) chain which comprises the murine anti 
CEA variable region and human y-2 constant region. 

A DNA sequence encoding the human garrena-2 heavy chain is 
prepared as follows: the cDNA library obtained by standard 
techniques from a human multiple myeloma cell line is probed with 
5' GGGCACtCGACACAA 3' to obtain the plasmid containing the cDNA 
insert for human gamma-2 chain (Takahashi, ert al_., Cel 1 , 29: 671 
(1982), incorporated herein by reference), and analyzed to verify 
its identity with the known sequence in human gamma-2 (Ellison, J., 
et a!.. Proc. Natl. Acad. Sci. (USA ), 79: .1984 (1982) incorporated 
herein by reference). 

As shown in Figure 11, two fragments are obtained from this 
cloned human gamma 2 plasmid (py2). The first fragment is formed by 
digestion with PvulI followed by digestion with Ava 111, and 
purification of the smaller DNA fragment, which contains a portion 
of the constant region, using 6 percent PAGE. The second fragment 
is obtained by digesting the p Y 2 with any restriction enzyme which 
cleaves in the 3' untranslated region of y2, as deduced from the 
nucleotide sequence, filling in with Klenow and dNTPs, cleaving with 
Ava III, and isolating the smaller fragment using 6 percent PAGE. 
(The choice of a two step, two fragment composition to supply the 
Pvul1-3' untranslated fragment provides a cleaner path to product 
due to the proximity of the Aval 11 site to the 3 terminal end thus 
avoiding additional restriction sites in the gene sequence matching 
the 3' untranslated region site. J' pyCEA207-1* is digested with EcoR 
1, treated with Klenov; and dNTPs to fill in the cohesive end, and 
digested with Pvu II, the large vector fragment containing promoter 
isolated by 6 percent PAGE. 

The location and DNA sequence surrounding the PvulI site in the 
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mouse gamma-1 gene are identical to the location and DNA sequence 
surrounding the PvuII site in the human gamma-2 gene. 

The plasmid resulting from a three way ligation of the foregoing 
fragments, pChiml, contains, under the influence of trp promoter* 
the variable and part of the constant region of murine anti-CEA 
gamma 1 chain, and a portion of the gamma 2 human chain. pChiml 
will, in fact, express a chimeric heavy chain when transformed into 
£. coli , but one wherein the change from mouse to human does not 
take place at the variable to constant junction. 

Figure 12 shows modification of pChiml to construct pChim2 so 
that the resulting protein from expression will contain variable 
region from murine anti CEA antibody and constant region from the 
human y-2 chain. First, a fragment is prepared from pChiml by 
treating with Nco I, blunt ending with Klenow and dNTPs, cleaving 
with Pvu II, and isolating the large vector fragment which is almost 
the complete plasmid except for short segment in the constant coding 
region for mouse anti CEA. A second fragment is prepared from the 
previously described py2 by treating with Pvu II, followed by 
treating with any restriction enzyme which cleaves in the variable 
region, blunt ending with Klenow and dNTPs and isolating the short 
fragment which comprises the junction between variable and constant 
regions of this chain. 

Ligation of the foregoing two fragments produces an intermediate 
plasmid which is correct except for an extraneous DNA fragment which 
contains a small portion of the constant region of the murine anti 
CEA antigen, and a small portion of the variable region of the human 
gamma chain. This repair can be made by excising the Xba I to Pvu 
II fragment and cloning into M13 phage as described by Messing 
et al_.. Nucleic Acids Res . 9: 309 (1981), followed by in vitro site 
directed deletion mutagenesis as described by Adelman, et al_., DNA 
2, 183 (1983) which is incorporated herein by reference. The 
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Xba I-Pvu II fragment thus modified is ligated back into the 
intermediate plasmid to form P Chim2. This plasmid then is capable 
of expressing in a suitable host a cleanly constructed murine 

variable/human constant chimeric heavy chain. 

i 

In an analogous fashion, but using mRNA templates for cDNA 
construction for human kappa rather than y chain, the expression 
plasmid for chimeric light chain is prepared. 

0 The foregoing two plasmids are then double transformed into 

E. coli W3110, the cells grown and the chains reconstituted as set 
forth in paragraph E.1-E.3 supra. 

E .5 Preparation of Altered Murine A nti-CEA Antibody 

15 ' . 

E. 5.1 Construction of Plasmid Vectors for Direct Express! on_ _of 

Altered Murine Anti-CEA Heavy Chain Gene 
The cysteine residues, and the resultant disulfide bonds in the 
region of amino acids 216-230 in the constant region of murine 

20 anti-CEA heavy chain are suspected to be important for complement 
fixation (Klein, et al_., Proc. Natl . Acad. Sci., (USA ), 78: 524 
(1981)) but not for the antigen binding property of the resulting 
antibody. To decrease the probability of incorrect disulfide bond 
formation during reconstitution according to the process of the 
25 invention herein, the nucleotides encoding the amino acid residues 
226-232 which includes codons for three cysteines, are deleted as 

fol1ows: 

A "deleter" deoxyoligonucelotide, 5' CTAACACCATGTCAGGGT is used 
30 to delete the relevant portions of the gene from pyCEAtrp207-l* by 
the procedure of Wallace, et al... Scien ce, 209: 1396 (1980) or of 
Adel man ret aK, DNA 2, 183 (1983). Briefly, the "deleter" 
deoxyol igonucelotide is annealed with denatured p Y CEAtrp207-l* DNA, 
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and primer repair synthesis carried out in vitro , followed by 
screening by hybridization of presumptive deletion clones with P 
labelled deleter sequence. 

E.5.2 Production of Cysteine Deficient Altered Antibody 

The plasmid prepared in E.5.1 is transformed into an JE. coli 
strain previously transformed with pKCEAtrp207-l* as described 
above. The cells are grown, extracted for recombinant antibody 
chains, and the altered antibody reconstituted as described in 
E.1.10. 

E.6 Preparation of Fab 


E.6.1 Construction of a Plasmid Vector for Direct Expression 
of Murine Anti—CEA Gamma 1 Fab Fragment Gene 
p Y CEAFabtrp207-l* 

Figure 13 presents the construction of p-rCEAFabtrp207-l*. . 5 wg 
of pBR322 was digested with Hind III, the cohesive ends made flush 
by treating with Klenow and dNTPs; digested with Pst I, and treated 
with BAP. The large vector fragment, fragment I, was recovered 
using 6 percent PAGE followed by electroelution. 

5 u g of pyCEAtrp207-1* was digested with both BamH I and Pst I 
arid the -1570 bp DNA fragment (fragment II) containing the trp 
promoter and the gene sequence encoding the variable region 
continuing into constant region and further into the anti-CEA gamma 
1 chain hinge region, was isolated and purified after 
electrophoresis. 

Expression of the anti-CEA gamma 1 chain Fab fragment rather 
than complete heavy chain requires that a termination codon be 
constructed at the appropriate location in the gene. For this, the 
260 bp Nco I - Nde I DNA fragment from 20 wg of the py298 was 
isolated and purified. A 13 nucleotide DNA primer, the complement 
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of which encodes the last 3 C-terminal amino acids of the Fab gene 
and 2 Dases of the 3 needed for the stop codon, was synthesized by 
the phosphotriester method (supra). The probe hybridizes to 
nucleotides 754 to 767 (Figure 4) which has the following sequence: 

AspCysGlyStop 
5* GGGATTGTGGTTG 3* 

The third base of the stop codon is provided by the terminal 
nucleotide of the filled-in Hind III site from pBR322 cleavage 
described above. 500 ng of this primer was used in a primer repair 
reaction by phosphorylation at the 5* end in a reaction with 10 
units T4 DMA kinase containing 0.5 mM ATP in 20 wl, and mixing with 
-200 ng of the Nco I-Nde I DNA fragment. The mixture was heat 
denatured for 3 minutes at 95* and quenched in dry-ice ethanol. The 
denatured DNA solution was made 60nW NaCl, 7mM rtgCl.,, 7 mM Tris 
HC1 (pH 7.4), 12 mM in each dNTP and 12 units DNA Polymerase I-Large 
Fragment was'added. After 2 hours incubation at 37*C, this primer 
repair reaction was phenol/CHCl 3 extracted, ethanol precipitated, 
digested with BamH I and the reaction electrophoresed through a 6 
percent polyacrylamide gel. ~50 ng of the 181 bp blunt end to BamH 
I DNA fragment, fragment III, was isolated and purified. 

-100 ng of fragment I, -100 ng each of fragments II and III were 
ligated overnight and transformed into E. coli K12 strain 294. 

Plasmid DNA from several tetracycline resistant transformants was 
analyzed for. the proper construction and the nucleotide sequence 
through the repair blunt end filled-in Hind III junction was 
determined for verification of the TGA stop codon. 

£.6.2 . Production of Fab Protein 

The plasmid prepared in E.6.1 is transformed into an E. coli 
strain previously transformed with pKCEAtrp207-l* as described 
above. The cells are grown, extracted for recombinant antibody 
chains and the Fab protein reconstituted as described in E.1.10. 
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The appended claims set out the principal areas for 
which a monopoly is presently claimed. In addition, the 
following preferred features should be noted: 

the antibody of claim 3 which is directed against CEA; 
the antibody of claim 3 wherein the heavy chain is of 
the gamma family; 

the antibody of claim 3 wherein the light chain is of 
the kappa family; 

the composition of matter of claim 8 which is 
mammalian; 

the composition of matter of claim 8 which is 
immunoreactive against CEA; 

the sequence of claim 9 which is a mammalian heavy 

chain; 

the sequence of claim 9 which is anti-CEA heavy chain; 
the sequence of claim 10 which is a mammalian light 

chain; 

the sequence of claim 10 which is anti-CEA light 

chain; 


the recombinant host cells of claim 16 which are 
microbial host cells; 

the method of claim 17 wherein the vector of b) and 
the vector of d) are transformed into the same host cell 
culture, and 

the sequence of a) and the sequence of c) are inserted 
into the same replicable expression vector; 

the method of claim 17 wherein the DNA sequence of a) 
encodes mammalian heavy chain, and the DNA sequence of c) 
encodes mammalian light chain; and wherein both DNA 
fragments encode amino acid sequences of the same mammalian 
antibody; 

the method of claim 17 wherein the DNA fragment of a) 
encodes a chimeric hybrid heavy chain and the DNA sequence 
of c) encodes a chimeric light chain; and 

the method of any one of claims 17 to 19 wherein said 
vectors are transformed into the same host cell culture. 


35 





0125023 


-5 5- 


CLAIMS 

1. An immunoglobulin produced by recombinant host cells. 


An immunoglobulin substantially free of other proteins 
with which it is normally associated in vertebrate cells. 

3 ^ The immunoglobulin of claim 1 or 2 which is a 

mammalian antibody, in that the amino acid sequences of all 
10 four chains are homologous to the sequences in the 
corresponding chains in an antibody derived from a 
mammalian species. 


4. The immunoglobulin of claim 1 or 2 which is a hybrid 

15 antibody, a composite non-specific immunoglobulin, a 
chimeric antibody, or an altered antibody. 


5. • A chimeric antibody of claim 4 wherein the constant 
regions of all four chains are homologous to the 

20 corresponding constant regions of an antibody of a first 
mammalian species, and the amino acid sequence of the 
variable regions of all four chains are homologous to the 
variable regions in an antibody derived from a second, 
different, mammalian species. 

25 

6 . A composition of matter consisting essentially of a 
univalent antibody. 

7. A composition of matter consisting essentially of Fab 
30 protein. 

8 . A composition of matter of claim 6 or claim 7 which 
is produced by recombinant host cells. 


35 


9. A sequence of amino acids produced by recombinant 

host cells corresponding to immunoglobulin heavy chain. 
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10. A sequence of amino acids produced by recombinant host 
cells corresponding to immunoglobulin light chain. 

11. A sequence of claim 9 or claim 10 which is a chimeric 
heavy chain or light chain, respectively. 

12. A sequence of claim 11 wherein that portion of the 
sequence which corresponds to the constant region is 
homologous to corresponding sequence of an antibody derived 
from humans, and the amino acid sequence of the variable 
region is homologous to the corresponding amino acid 
sequence of an antibody derived from non-human mammalian 
species. 


15 13. A DNA sequence which encodes for the immunoglobulin 

of claim 1 or 2, the composition of matter of claim 6 or 
the amino acid sequence of claim 9 or claim 10. 

14. A replicable expression vector capable of expressing 
20 in a suitable host cell the DNA sequence of claim 13. 

15. An expression plasmid which comprises the DNA sequence 
of claim 14 operably linked to a promoter compatible with a 


25 


suitable host cell. 

16. Recombinant host cells or host cell cultures 
transformed with the vector of claim 14 or 15. 


17. A method for preparing immunoglobulins in recombinant 
30 host cells which method comprises 

a) preparing a DNA sequence encoding heavy chain, 

b) inserting the sequence of a) into a replicable 
expression vector operably linked to a suitable promoter, 

c) preparing a DNA sequence encoding light chain, 

d) inserting the sequence of c) into a replicable 
expresion vector operably linked to a suitable promoter. 
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e) transforming host cell culture with the vector of 

b) and host cell culture with the vector of d), 

f) recovering light chain and heavy chain from cell 

culture, 

5 g) reconstituting light and heavy chain, 

wherein steps f) and g) may be performed either 
sequentially in either order, or simultaneously- 

i 

18. A method for preparing Fab protein in recombinant 

10 host cells which method comprises 

a) preparing a DNA sequence encoding the Fab region 

of heavy chain, 

b) inserting the sequence of a) into a replicable 
expression vector operably linked to a suitable promoter, 

^ c ) preparing a DNA sequence encoding light chain, 

d) inserting the sequence of c) into a replicable 
expression vector operably linked to a suitable promoter, 

e ) transforming host cell culture with the vector of 
b) and host cell culture with the vector of d), 

20 f) recovering light chain anc Fab protein of heavy 

chain from cell culture,. 

g) reconstituting light and heavy Fab region chains; 
wherein steps f) and g) may either be performed 
sequentially in either order or simultaneously. 


25 

19. A method for preparing univalent antibody in 
recombinant host cells which method comprises 

a) preparing a DNA sequence encoding heavy chain, 

b) inserting the sequence of a) into a replicable 
30 expression vector operably linked to a suitable promoter, 

c) preparing a DNA sequence encoding light chain, 

d) inserting the sequence of c) into a replicable 
expression vector operably linked to a suitable promoter, 

e) preparing a DNA sequence encoding the Fc portion 

35 of heavy chain. 
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f) inserting the sequence of e) into a replicable 
expression vector operably linked to a suitable promoter, 

g) transforming host cell culture with the vector of 

b), host cell culture with the vector of d), and host cell 

5 culture with the vector of f), 

h) recovering light chain, heavy chain, and Fc portion 
of heavy chain from cell culture, 

i) reconstituting light chain, heavy chain, and Fc 
portion of heavy chain, 

10 wherein steps h) and i) may be performed 

sequentially in either order or simultaneously. 

20. A method for preparing heavy chain or light chain 
which method comprises 

15 a) preparing a DHA sequence encoding heavy or light 

. — chain, 

b) inserting said sequence into a replicable 
expression vector operably linked to a suitable promoter, 

c) transforming host cell culture with the vector of 

20 b) , and 

d) recovering heavy or light chain from cell culture. 

\ 

21. A method for preparing Fab region of heavy chain as a 
polypeptide which method comprises 

25 a) preparing a DNA. sequence encoding Fab region of 

heavy chain, 

b) inserting said sequence into a replicable 
expression vector operably linked to a suitable promoter, 

d) transforming host cell culture with the vector of 

30 b), 

d) recovering Fab region of heavy chain from cell 
culture. 





0125023 



F(ob) z fragment 
J Fob' fragment 

K t 


Fc fragment 






haelll 


I 


>*l 


0125023 


2 /« 


f 

j 

’r 


( l 



40 c 

»> 

0) 

4- 


a**- 

U 

to CD CD 

C CD CD 

*£ *— 

££0U 

40 CD CD 

£ < H* 

•*-<K 

h- C 


1— <■£ 

< f- 

J= CD CD 

«£ »— 

CD CD 

< »— 

CD CD 

CD CD 

»— 

CD CD 

< H- 

CD CD 

*— «£ 

•—» CD CD 

< *— 

< h- 

I— < 

CD CD 


CD CD 

•— 

CD cD 

H- 

O *- 

K- «* 

CD O 

CD CD 

CD CD 

C CD CD 

CD CD 

«£ h- 

< h- 

< H- 


< (- 

CD CD 

f CD CD 

CD CD 

.C CD CD 

CD CD 

CD CD 

Q. 





£ H < Z 

•—t •—» CD CD 

*— «C 

CD CD 

<C b- 

«£ t— « 

U_ « CD CD 

CD CD 

H- ^ 

< >“ 

CD CD 

L •*- CXCD CD 

»— 

< H- 

CD CD 

CD CD 

ID O CD CD 


CD CD 

I— < 

< H- 

(A C CD CD 

•— *£ 

CD CD 

CD CD 

CD CD 

I— 

b~ 

< •- 

< h* 

fr— ^ 

CD CD 


CD CD 

CD CD 

CD CD 

CD CD 


CD CD 

CD CD 

CD CD 

H- < *-i 

■« »- 

CD CD 

CD CD 

•—l CD CD 

X 

CD CD 

•-iK < 

J— < 

«—1 

^4 

40 

CD CD 4- 

C b- 


< 1- 

JC CD CD 

CD CD </> 

O CD 

a CD CD O 

CD CD 


O U 
CD (J 
«t ►— 


*C t— 
u o 
►- <£ 


(J CD 
U CD 


CD CD 
I— 


CD CD 

I— C 


CD CD 
CD CD 


CD CD 
t— «t 
CD CD 
< I— 
►“ <C 
CD CD 


CD CD 
CD CD 


I— «C 
C I— 
CD CD 

«C H- 
CD CD 
I— 
CD CD 
CD CD 
«£ I— 


CD CD 
CD CD 
b- < 
CD CD 
I- 

CD CD 
I— «C 


CD CD 
CD CD 


CD CD 
CD CD 


•—» CD CD 
•—* i—• CD CD 
Ll. QC H < 

c o 

o o CD CD 

ut at cd cd 

CD CD 
«£ *— 

< I— 
»— -t 
*—< CD O 

ZC CD CD 

■O' b~ *£ 

3 > 

C -O CD CD 
M- DD CD CD 
H- 

CD CD 
CD CD 
CD CD 
►— 

CD CD 


O CD CD 
4- CD CD 
H- 
CD CD 
I- < 
CD CD 
C •- 
CD CD 
CD CD 
CD CD 


CD CD 
CD CD 


CD CD 
h- «t 
CD CD 

CD CD 
t— 
CD CD 
CD CD 


CD CD 
CD CD 



CD CD 

CD CD 



< b- 


*— 

4 




' 

CD CD 

O CD 

CD CD 


*— 

u. OC CD CD 

CD CD 


CD CD 

C 0<K 

£<K 


1— c 

o O CD CD 

au o 


CD CD 

4A <U CD CD 

^ < 


»— 

<C »— 

CD O 


CD CD 

»— 

t— 

t 

^ CD CD 

^ t— 

CD O 

1 

t CD CD 

CD CD 

<c »— 


£<h 

H- 

CD CD 


4-> CD CD 

CD O 

»— «£ 


H* *C 

«c t— 

h- 


C *— 

< »- 

*— <t 


O CD 

CD CD 

I— 



»— < 

CD CD 


CD CD 

«C 1— 

«t b- 

1 

1— <C 


CD O 


CD CD 
CD CD 

CD CD 
< I— < 
•—I CO •—I CD CD 
ih 3 CK < 

o «b a< »— 

£ W T) CD O 
X CD CD 

I— < 
CD CD 


CD CD 
CD CD 
CD CD 
•— 
CD CD 
i— 
CD CD 


CD CD 
CD CD 
*- <C 
»— 

►— 

CD CD 


m(5U 
O «C I— 


£ CD CD 
CD CD 


CD CD 


«£ CD CD 
ro • cd CD 


l/t "O CD CD 
►- -< 
CD CD 
CD CD 
CD CD 


1— < 

b~ 

CD CD 

CD CD 

-C 1— 

«t »- 

«t »— 

«t K- 

CD CD 

CD CD 

CD CD 



CD CD 

—i CD CD 

1— < 

3Z CD CD 

< K 

1— 

< »— 

=3 > 

C a CD CD 

CD CD 

(«- £) CD CD 

b- < 

»- 

C 1- 

«C »— 

CD CD 

CD CD 

*C »- 

1— 

CD CD 

CD CD 

< 1— 

CD CD 

< K- 

CD CD 

»— 

CD CD 

CD CD 

1—4 

3C CD CD 

CD CD 

D<H 

CD CD 

4- C *- 

CD CD 

< f— 

I— 

CD CD 

h- « 

•-«*—<£ 

CD CD 

3 CD CD 


»— CD CD 


40 C I— 

CD CD 

CD CD 

«C *— 

CD CD 

«t »— 

« »— «t 

•-i CD CD 

3 CD CD 

C<K 

p— CD CD 

E<K 

«<K 

X CD CD 

»— 

»— 

CD CD 

1— < 

CO *—• CD CD 

CD CD 

CTi t~i I— 

1— «t 

3 40 CD CD 

*— «C 

40 > 

CO CD CD 

CD CD 

CD CD 

CD CD 

f- «t 

►— 

CD CD 

CD CD 

CD CD 

1— «t 

1— 

CD CD 

CD CD 

CD CD 

CD CD 

1- -t 

CD CD 

CD CD 

*— 

»- 

I— 

i-i 04 CD CD 

CD CD 


CD CD 

C "O CD CD 

«r i— 

E CD CD 

CD CD 

CD CD 

1- *c 

1- 

• CD CD 

4-t CD CD 

^ «C 

r- CD CD 

C CD CD 

C < •- 

E CD CD 

E CD CD 

i— 

CD CD 

< H- 

H < 

h- < 

CD CD 

CD CD 

1- 

CD CD 

•- 

CD CD 

CD CD 

CD CD 

-a: t— 

CD CD 

< i— 

«C 1- 

»— «£ 


1— < 


<\| 

u: 


o 

OJ 




01 25023 







Ml r) 


0125023 


O L u 

O Q. CD 

O C- ZD 

o as 

(SJ UU 

in lo 

CO Of CD 

(A ^ 

v* At 

<-» ZD 

*A ZD 

— to CD 

— o 

L CD 

C CD 

tO Z3 

AJ =3 

>>< 

•— c 

*— CD 

> o 

«*-» ZD 

Of CD 

to CD 

ojcd 

Of ^ 

•— CD 

cno 

L. O 

*— ZD 

IO 3 

L CD 

a> « 

*C 

> CD 

CD 

au 

3 CD 

C IS 

IA < 

«a « 

a* s> 

«A < 

>l< 

« o 

— CD 

AJ < 

— «t 

>»< 

3 At 

l> CD 

3 CD 

r- O 

Of zd 

Of CD 

Of S 

o»u 

*— CD 

IA < 

— CD 

— «c 

trt < 

Of 13 

3 CD 

to zd 

>»*t 

p- ZD 

r— ^ 

> o 

■— < 


Of CD 

t < 

O =3 

L CD 

3 CD 

o> cd 

l- CD 

JC CD 

Of ^D 

•A Z3 

CL CD 

-*-» <t 

p“ CD 

l- < 

ZD 

3 CD 

iA CD 

£ U 

Oi CD 

Of ZD 

>f< 

-*-* c 

iA ZD 

p— CD 

— <t 

t u 

c At 

1- ID 

C CD 

a> CD 

At 

^ CD 

-C CD 

w zs 

cr>u 

-*-* «t 

•*-» «£ 

<-* o 


Of CD 

3S CD 

Of Z3 

*~ CD 

£ =3 

p— CD 

E «£ 

cncD 

CL ZD 

Of CD 

o o» CD 

O o «t 

O Q.ZS 

O to S3 

J= => 

v c a 

^ W< 

O »■— CD 

£L=> 

CL CD 

«D CD 

^ IB O 

fcrt < 

wi At 

u «x 

3s 33 

>*«t 

>»«t 

JZ CD 

r- CD 


•— <C 

■m <r 

Of CD 

*A u 

c CD 

>»o 

Of CD 


<— «t 

*— o 

-c => 

•c cd 

o>o 

CD CD 

CL ZD 

l- S3 

cz at 

L- ZD 

L- CD 

O' CD 

\ — c 

Of CD 

-C CD 

trt ZD 

C7»CJ 

«A ZD 

*-> 

c o 

1- ZD 

>»«* 

3 CD 

p— «t 

>f <t 

— O 

Of ZD 

do 

♦J ZD 

Of CD 

r— CD 

1- CD 

CL CD 

1- CD 

O s 

jc cj 

U CD 

Of CD 

l- CD 

**-■ «t 

-*-» ZD 

Vi «t 

CL CD 

*-» o 

tO CD 

>»tD 

c s 

Of 2 

«— CD 

•— CD 

>>< 

E *S 

tO CD 

0>CD 

+> =3 

*— o 

a* c 

t < 

>3 CD 

fO D 

•— S3 

JC CD 

*“ O 

> CD 

*p- 

«-> <t 

OfCD 

QJ ZD 

*o Zd 

Of CD 

c CD 

*— => 

•— CD 

x: zd 

Of CD 

•*- 

tO CD 

CL =3 

«A < 

—* CLCD 

*0 ZD 

Of CD 

L. ;z> 

(A < 

*— CD 

L- CD 


(O CD 

<Q CD 

IQ CD 

•M IS 

>f< 

O >*=3 

O CL ZD 

o c < 

p— CD 

CD r— CD 

1C «A< 

Of «r 

OfO 

OfCD 

to CD 

OfCD 

3 < 

*— CD 

O =3 

C <C 

p- « 

to s> 

»- CD 

»— «£ 

OfO 

> CD 

CL CD 

OfCD 

f— 3 

as 

f— CD 

IA ZS 

<B ZD 

•* At 

<o o 

3sCD 

> O 

to CD 

> CD 

U IS 

>1 = 

c CD 

>f«t 

Of CD 

*— CD 

p- At 

CD 

-C Z3 

Of o 

OfCD 

OfCD 

CL S3 

C. D 

1- =3 

U =3 

L ZD 

01 CD 

Of o 

f CD 

>»< 

«a n> 

IA At 

■M C 

<-» S3 

3 CD 

aj CD 

IA CD 

O.S3 

0) ID 

p— CD 


IA 

p— ^ 

to CD 

-C CD 

« CD 

Q. CD 

«A CD 

0*0 

<D «t 

L CD 

>>< 

c. CD 

i— CD 

4-» ZD 


to CD 

•O CD 

3 CD 

IA CD 

U CD 

3 CD 

Of =3 

>fO 

XT CD 

Of 2 

p— CD 

ID S 

♦-* C 

r- SD 

Ot 3 (9 

1- CD 

U CD 

a cd 

1 Of ZD 

J= CD 

Of CD 

IA «( 

•— 3 

**-» <t 

IA ZD 

*D CD 


CD AJ At CL S3 

p “ 2 *“ <-> I* At 

** <o O A> o 


O L. CD 

O CL CD 

O L < 

C «t 

^ 3s«t 

rs. tA < 

O x: cd 


-•PS 

—* A> CD 

(VI *f < 

— 1 ^ 

(J < 

phe 

UUC 

IA ^ 
>f«t 
— 

IA O 

— «t 

=3 CD 

IS At 

<-> CD 

CD CD 

C CD 
ia «r 
o ^ 

CD 
Of CD 

IA «t 

his 

CAC 

CD S3 

=3 ZS 

=> =3 

CD At 

C CD 
IA 

CL CD 
IA «t 

L- 3 

x: cd 

=> =3 

C At 


AJ O 


<-> ZD 

3 CD 
Of IS 

C CD 

OfCD 

ala 

GCC 

CD CD 

IS CD 

< => 

CD At 

phe 

UUC 

as 

IA 

A3 CD 

3 CD 

p- < 

OfCD 

O CD 

=3 ZD 

CD At 

CD S3 

cys 

UGC 

S3 
x: cd 
c 

IA S3 
3StD 

U ZD 

C IS 

CD S3 

CD ZS 

CD CD 

p“ CD 
»0 ZD 

CL CD 

U o 

*- CD 
JC CD 

CD CD 

=3 CD 

CD s 

> CD 

S3 

«-* «t 

CD ZS 

— CD 

t. ZD 

1- ZS 

< CD 

CD CD 

*0 S3 

Of CD 

>v«t 

CD S3 

> CD 

*A «t 

IS 

sa 

L < 

Of CD 

C CD 

IA <t 

u CD 

Of o 

CD ZD 

CD CD 

CD 

IA ZD 

AJ 

IA AC 

CD CD 

O A3 CD 

O 3 CD 

O C CD 

CD =S 

n r- CD 

LO Of Z3 

Of IA< 

<J (J 

• AJ CD 

—• — CD 


=3 

9 1 y 

GGU 

•“ CD 
*D S3 
> CD 

hi s 
CAU 

CD CD 

«C CD 

O ZD 

=> CD 

>>«* 

> CD 

Of At 

CD CD 

CD S3 

P- o 

Of O 

♦— CD 
OfCD 

1- CD 

AJ CD 

ZD CD 

CD S3 

1- => 

Of CD 

C ZD 

IA <f 

3 «r 
— «t 

tD S3 

< CD 

< CD 

IA ZD 

AJ 

OtcD 

«t S3 

<C 

c < 

ZD 

CD CD 

At CD 

■C CD 

«r 

— 

OfCD 

>f*t 
•*-> ZD 

CD At 

«r cd 

3 C 

Of IS 

Of «£ 

O 

3 CD 
' *— 

CD 

CD CD 

£ < CD 

*— ^ 

ID CD 

OfCD 

■* IS S3 

c o 

3 «E 

CL CD 

CD 

r w Z <_) 

p— < 

OfCD 

— C 
OfCD 

la AC 

AJ CD 

—• >f CD ZS 

M U s CD 

3 CD 

^ S3 

ia CD 

CD 

3 0 At 

*— < 

Of o 

Of CD 

IA AC 

3s< 

— <t 

*“ c c 

Of o < 

L ZD 

Of CD 

3SCD 
*— O 

l- CD 
-C CD 

CD 

c ZS CD 
•A Af S 

IA «t 

OfCD 

*■* c 

At CD 

t- CD 

CL ZD 

3 CD 

CD 

OfCD IS 

Of CD 

IA At 

Of ZS 

L O CD 

IA ZS 

AJ CD 

p- =3 

AJ At CD 

o o «c 
(Vi u u 

O Of S3 

to — zs 

O LU 

CO JC CD 

CD 

O C CD CD 

H W< ZS 

' CL CD 

— 1 At 

—• -M At 

C\J AJ At ZZ) 

o < 

IA CD 

3 CD 

CD 

Of CD S3 

L- CD 

>>«t 

Of ZD 

^ ZD CD 

CL CD 

P— «c 

p— CD 

Q.IS At 

Of CD 

CL CD 

L CD 

CD 

L CD CD 

•C ZD 
O.ZS 

o 

S3 

x: o 

4J At 

Of CD At 

to = At 

lie 

AUC 

lA CD 

>f«C 
— < 

ser 

AGC 

CD O 

K» O CD ZS 

>f«t ^ ZD 

«£ CD CD 

L CD 

Of CD 

IA ZD 

val 

GUC 

ser 

AGC 

CD < 

CD CD CD 

OS < CD 

> o < s 

»— ^ 

<o ZS 
> CD 

asn 
A All 

«D CD 

Of ZD 

E «< 

o S3 

Of =3 At S3 

»— =3 CD CD 

*»“ CD S 

w IS 

xr cd 
^ c 

He 

AUC 

L CD 

Of CD 

IA At 

CD O 

O CD CD At 

<-> =3 CD 

CL CD 3 S3 

pro 

CCA 

asp 

GAC 

L CD 
3S< 

«*-» IS 

ser 

UCA 

AGGC 

AAAG 

ala 

GCA 

IA At 
»l< 
p— 

L. CD 

X u 
<-> «£ 

t CD IS 

U S3 O At 

-C CD ^ «t 

<•“» C =3 CD 

<D S3 

O CD 

L CD 

CD S3 

t < S) = 

p-“ CD 
<D CD 

t CD 

ao 

Of CD 

IA At 

Of CD CD At 
IA S3 O SD 


"S 

.5* 

U. 


u 



sau96 ddel 

hlnfl avail mnll alul. ahalll sfaNI 

GAGTCAGCAC TGAACACGGA CCCCTCACGA TGAACTTCGG GCTCAGCTTG ATTTACCTTG TCCTTGTTTT AAAAGTTGTC CAGTGTGAAG TGATGCTGGT 

CTCAGTCGTG ACTTGTGCCT GGGGAGTGCT ACTTGAAGCC CGAGTCGAAC TAAATGGAAC AGGAACAAAA TTTTCAACAG GTCACACTTC ACTACGACCA 


0125023 


sj iq 


CJ o 

< *— 

1— «* ’ 

«< 

»— 

CD O 

CD CD 

CD CD 

•— ro 

— CD O 

CJ CD 

CD CD 

<£ k- 

13 

Ch< 

♦— 

k- 

►— «t 

O to •—• 

Cl <c t— 

1— <£ 

CD CD 

k- 

X *-> W •-< 

TD O CD 

CD CD 

I- < 

CD CD 

X LL. cxl 

CD CD 

CD CD 

«r k- 

CD CD 

c o 

♦— 

CD CD 

CD CD 

>— 

o ■ o 

CD CD 

t— c 

CD CD 

H* 

(A © 

— CD CD 

*— < 

•-* <K 

♦— «X. 

CO 

•— CD CD 


JT 


o\ 

•—> 

CD CD 

CX—• CD CD 

t— 

© 

© CD CD 

*“ C 

< 

t— 

IO 

<0 CD CD 

CD CD 

Ch< 

i- 

iA 

x: CD CD 

f- c 

— < k- 

•— CD CD 


1— < 

■< t— 

xr cD cd 

*— t- 


CD O 

CD CD 

CD CD 

C CD CD 


♦— 

CD CD 

CD CD 

E CD CD 


CD CD 

CD CD 

CD CD 

CD CD 


CD CD 

h- 

CD CD 

CD CD 


»— 

k- 

1— 

CD CD 


<C »— 

k- *£ 

«* k- 

»— 


I— -X 

»— 

CD CD 

CD CD 


CD CD 

CD CD 

1— *t 

»— 


1— 

k- 

CD CD 

<c »— 


CD CD 

»“ c 

1— <C 

CD CD 


►_ <c 

CD CD 

CD CD 

CD CD 


CD O 

< k- 

k- 

1— <C 


1— 

CD CD 

CD CD 

CD CD 


<C >— 

k- <t 

<C J— 

1— <E 


CD CD 

1— 

CD CD 

CD CD 


CD CD 

1— <C 

<T t— 

-C t— 


CD CD 

CD CD 

CD CD 



CD CD 

k- 

CD CD 

»— 


CD CD 

•— CD CD 

i— 

1— 


t— 

•*- t- 

«£ k- 

1“ 


CD CD 

Ch< 

CD CD 

CD CD 


t— 

^<h 

CD CD 

t— 


O CD 

-C CD CD 

k- 

«<H 


CD CD 

CD CD 

k- 

• CD CD 


C h- 

*— c 

CD CD 

•—• CD CD 


«c t— 



<U 



• CD CD 

CD CD 

fO CD CD 


< I— 

*— k- < 

<h^ 

x: cd cj 


< t- 

•— C CD CD 

CD CD -X* 

CD CD 


CD CD 

X EOC 

C ♦— © 

I— 


CD CD 

CD CD 

CD CD *•— 

CD CD 


CD CD 

3 ><H 

h- «* 

< t— 


-< K 

c= x> CD O 

h- <C 

•— CD CD 


© CD CD 

-D CD CD 

CD CD 

*- CD CD 

r— 

*© t— <c 

k- 

C h- 

C <h 

c: 

T3 CD CD 

CD CD 

k- 

E — CD CD 

E 

CD CD 



© 



i— c 

CD CD 

■o t— < 


»— <C 

CD CD 

CD CD 

■o CD CD 


CJ CD 

CD CD 

t— 

1— «£ 


f— 

I— < 

CD CD 

•—* O CD 


CD CD 

CD O 

CD CD 

r— CD CD 


CD CD 

k- C 

t— 

C h- 


*— CD CD 

CD CD 

CD CD 

E © CD CD 


£<h 

k- 

C t— 

'O t— «t 


CXCD CD 

< k- 

1— < 

-o CD CD 


JZ t— 

*£ k- 

CD CD 

t— 


CD CD 

CD CD 

<c t— 

CD CD 


•—*£»- 

►— < 

1— 

< »— 

*—> 

© CD O 

CD CD 

k- «t 

CD CD 

,— 

Dh< 

VO ■— CD CD 

<C k- 

CD CD 

c 

*0 CD CD 

^ — CD CD 

CD CD 

t— 

E 

CD CD 

13 «Oh C 

CD CD 

CD CD 


<C 1— 

«3 > CD CD 

h~ 

1— «t 


h- 

<D CD CD 

C k- 

< »— 


CD CD 

—» CD CD 

CD CD 

«t I— 


CD CD 

»— <h 

CD CD 

C 1— 


C 1— 

C —i 





•— E CD CD 

CD CD 

CD O 


1— 

U_ CC CD CD 

♦— «=£ 

CD CD 


CD CD 

U Ot-< 

CD CD 

h- 


1— 

U CD CD CD 

CD CD 

CD CD 


CD CD 

1A dt CD O 

CD CD 

*—* CD CD 


CD CD 

CD CD 

1— «t 

to <C I— 


CD CD 

h~ 

CD CD 

h- 


CD CD 

CD CD 


C CD CD 


1— <C 

CD CD 

CD CD 

1- < 


CD CD 

k- C 

CD CD 

CD CD 


<5 »— 

«< »— 

k- C 

CD CD 


1— C 

k- 

CD CD 

CD CD 


CD CD 

k- <C 

*— CD CD 

< ♦— 


*t 1— 

<c 

•— CD CD 

CD CD 


CD CD 

•— CD CD 

c«<t- 

< 1— 


CD CD 

**- J— c 

E —• CD CD 

c »— 


H- 

c CD CD 

0<h 

CD CD 


|— 

*—<£)— 

J3 «C k- 

< 1— 


t-v 

JX CD CD 

E CD CD 

•X t~ 


CD CD 

CD CD 

<C k- 

CD CD 


CD CD 


»—« 




CD CD 

•— CD CD 

CD CD 


»— «£ 

CD CD 

<0 CD CD 

CD CD 


< h- 

CD CD 

CL CD CD 

f— «£ 


b— «C 

*— <C 

-C *—* CD CD 

c »— 


CD CD 

*-t CD CD 

H- k- < 

I— 


1— 

««- k- <t 

C CD CD 

CD CD 


CD CD 

C CD CD 

♦r- <t h- 

c »— 


CD CD 

■*- k- 

XZ CD CD 

CD CD 


CD CD 

JX CD CD 

< t— 

«r t— 


CD CD 

CD CD 

CD O 

CD CD 


< »“ 



*—« 



O 

O 

O 


O 


CVJ 

ro 




Fig. 4 A. 


scrFI 








0125023 


cd u 

L> CD 
CD CJ 
■*t I— 
CJ CD 
U 13 
F— 

CD U 
F— <t 
U o 

« CJ CD 

CO ■ •—< c_> 

3 C IE I— 

o to aE < h 

■c w ti (c o o 

x JG CD CJ 

F- < 

CJ O 
F— 

u o 


CD CJ 

F— 

F— C 

CJ CD 

CJ CD 

F— «£ 

•— CJ CD 

< F— 

-C<K 

F— < 

C.CJ CD 

CD CJ 

-C F- < 

< F— 

CD CJ 

<t b- 

CJ CD 

CD CD 

CJ CD 

F- 

C F— 

CJ CD 

CD CJ 

CJ CD 

<1 F— 

CJ CD 

CD CJ 

f— 

CJ CD 

CD CJ 

CD CD 

<£ F— 

«t F- 

*—* CJ CD 

•—• CJ CD 

«<F- 

*—• CD CD 

Vi F— 

CJ CD 

a> 

U CD CJ 


CD CJ 
U.QCC3U 
l- OF- < 

u u cj cd 

*/» CD CJ CD 
«t F— 
CD CJ 


CD CD 
■CC F— 
CJ CD 
«£ t— 
CD CJ 
F— -=C 
CD CJ 
<C J— 
CJ CD 
CJ CD 

CD CJ 

<U CD CJ 
■Oh< 
*3 CJ CD 
CJ CD 
CJ CD ‘ 

f— «c 
*— <c 


■< F— 

f— <c 

CJ CD 
CD CJ 
CD CJ 
CD CJ 
«t >— 
<£ F— 
CJ CD 


J *0 CD CJ 
’ *C CD CJ 

— CJ CD 

r- (— C 

C CJ CD 

E CJ CD 

CJ CD 
CD CJ 
C I— 
CJ CD 


a- 

■c •— i— «t 
*— 1 "O 3 CJ CD 
DC CD CJ 

h 

3 > 

c .O CJ CD 
v *~ -O CD CJ 
~ <C F— 
a; CD cj 
*Oh< 

■O CJ CD 

*— C 
CJ CD 
<£ F— 
CJ CD 


O CJ 

C F— 

U. QC CD CJ 

F— «t 

L OH< 

CJ CD 

CF U CJ CD 

*— F— < 

<A U o CD 

C CJ CD 

CD CD 

E CJ CD 

F— < 

C F— 

*r f— 

CD CJ 

CD CJ 

F— C 

CD CJ 

CJ CD 

OC CD CJ 

F— < 

o F~ <c 

CD CJ 

O CJ CD 

C »— 

CJ CJ CD 

«-* CJ CD 


•—i CJ CD 
•— < F- 
UJ CD CJ 
VF -C 4_> F— <t 
O. (A CD CJ 
•CDC3U 


-M CD CJ 
v» h- c 
O. CJ CD 
CJ CD 
I— C 
CD CJ 


XXH 
C CJ CD 
CD CJ 
i— <r 
►- ^ 
CJ CD 
CJ CD 


. CJ o 

O CJ 


' F— 

<C F— 


: cj cd 

«C F— 


: CJ CD 

F— 


CJ o 

CD CJ 


CJ CD 

F— C 


f— <r 

F— «C 


CD CJ 

CD CJ 


b- <1 

CD CJ 


CJ CD 

F— C 


«E F— 

CD CJ 


CD CJ 

F— 


1— «C 

F— C 


O CJ 

< F— 

4-> 

«t F— 

•— CD <_) 

V) 

CJ CD 

CD CJ 

E 


«— 1 

F— <t 

*—t F— < 

O 

F— 

*™» 3 CJ CD 

o 

CJ CD 

3 r— CD CJ 

c 

CJ CD 

> ro «C F~ 


F— 

CL CJ CD 


CJ CD 

CJ CD 


«C t— 

CJ CD 


«C F— 

b— <C 


F— < 

f— <r 


CJ CD 

CJ o 


«t F— 

CJ o 


F— 

F— 


■C F— 

CJ CD 


CJ CD 

C F— 


CJ CD 

< CJ CD 

DC 

CJ CD 

CD CJ 


CD CJ 

OFF— «C 

3 

> F- *C 

-C CD CJ 

C 

3 O CD 

1— < 

4. 

JJ CD CJ 

CD CJ 


Ll. DC CD CJ 
i- O <3: h- 
O U CJ CD 
*/l Qj CJ CD 

CJ CD 
CD CJ 
t— 

CD CJ 
*— < 

►— «t 
< t— 
♦— 
F— 
«C F— 

CD CJ 

«r f— 

I— 
CJ CD 
«t F— 
CD CJ 
CD CJ 
H- «£ 
CD CJ 
CD CJ 


I— 
CJ CD 
CJ CD 
<1- 
CJ CD 
*3" CD CJ 

3 XF- 
c JO CJ CD 
**- JO CD CJ 

c »— 

' CJ CD 

a# *—• cj cd 

•—< *tJ *—t CD CJ 

Lc -c *—. na CD cj 

**- Q. CJ CD 
CJ CJ -C CJ CD 

w C CJ CD 

< F— 
CJ CD 

r— CJ CD 
OlU CD 
-O CD CJ 

f— <c 

F— < 
CD CJ 
CJ CD 
«t F— 
<C F~ 
CJ CD 


C 

E •—* CD CJ 
to CD CD 
< > CJ CD 

CD »—• io CJ CD 

3 C F— ^ 

*0 0. c ►— 

WF "O CD CJ 

F~ <E 
«<!- 
CD CJ 
O 

CD CJ 
►- 
>— 
CJ CD 
CD CD 


*— CD CD 
u< h 
UK < 
<0 CD CJ 
CD CJ 
F— 

CD CJ 
F— <t 
F— <£ 

CD CJ 
F— <C 
CD CJ 
F— 

CD CJ 
CJ CD 
< F- 
CJ CD 
F— <C 
CD cj 

CD CJ 
-<h 
o» < F— 
■O I— < 
"O CJ CD 

— CJ CD 
»- F— <t 
C CJ CD 

- <F- 
£ O CJ 

F— < 
CJ CD 
F— < 
CJ CD 
< F— 
F— -CC 
F— 

C F— 
CJ CD 
CJ CD 

: <h 

x CJ CD 
t— <t 
C CJ CD 
**- CD CJ 
Oil— < 

•coo 

F— C 

*— < F— 

CD CJ 
O 

W- CD CJ 


CD CJ 

f— «r 

CD CJ 
F— 
CJ o 

f— 

CD CJ 
«C F— 
CJ CD 

a* 

X) CJ CD 
CD CJ 
CJ CD 
CD CD 
F— «C 


CJ CD 
►— 

CJ CD 
CD CD 
C F~ 
CD CD 
«£ F— 
F— 
CJ CD 


CJ CD 
CD CJ 
«C »— 
• CD CJ 

»— CD CJ 

C c I- 

E CD CJ 

' CD CJ 

Lc • *—« CD CJ 

• O Ll_ t— *—* «T3 t—» 

*o U L U t- a <D O O 

E «/>oe<j.c >cjcd 

^ w C to CJ CD 
CJ CD 

f— 

F— 
CJ CD 
CD CJ 
CJ CD 
*0 ^ F— 
cn 

-o CD CJ 
^ <F- 

a> cj cd 
■O ^ F- 
"3 3 CJ CD 
*— CD CJ 
<D < (— 
CJ CD 
«t F— 
CJ CD 


< o o 

CD CJ 
cn <c 
*-> -c CD CJ 
r— CD CJ 
C <»- 
E CD CJ 
CD CJ 


CJ CD 
CJ CD 
CD CD 
CD CD 
1 F— 
«C F— 
h- 

CJ CD 
CJ CD 
CJ CD 
CJ CD 
CJ CD 
— CJ CD 
^ F— «t 
Oh< 
JD CJ CD 
E F- C 

»— 

•— 1 CJ CD 
*“• F— «C 
Oh< 
-O CJ CD 


CJ CD 
F— < 


CD CJ 
1“ ■< 
•C F— 
CD CJ 

F— 

«C F- 
CD CJ 
F— 
F~ 

CD CJ 


F- < 

F— 

CD CJ 
CD CJ 

—• F— <£ 

•—* 3 CJ CD 
3 *“ CD CJ 
> <0 ^ F- 
CX CJ CD 


b- < 
CD CJ 
F— 

'O *-• CJ CD 
0> •—« CJ CD 
3 <D I— «C 
*0 > CD CJ 
VI *d CD CJ 


QQ 

iC 



o 

o> 



fnu4HI 

scrFI bbv 

ecoRII hincll a 1uI taql 

1001 ACTTCCCATC ATGCACCAGG ACTGGCTCAA TGGCAAGGAG TTCAAATGCA GGGTCAACAG TGCAGCTTTC CCTGCCCCCA TCGAGAAAAC CATCTCCAAA 
TGAAGGGTAG TACGTGGTCC TGACCGAGTT ACCGTTCCTC AAGTTTACGT CCCAGTTGTC ACGTCGAAAG GGACGGGGGT AGCTCTTTTG GTAGAGGTTT 


0125023 



o 


i- 


E CD CD 


O CD 

►- 

~ ^ ►- 

< »— 

«r t— 

CD CD 

Cl CD CD 

< ►“ 

CD U 

«r *- 

T> ►- < 

«r h- 

<£ ♦— 


"O CD CD 


O o 

i- c 

< >— 

< »— 

^ h- 

CD CD 

<r 


^ h- 

h- <C 

c ►- 

CD CD 

< 

U CD 

CD CD 

»— •< 

►— 

CD CD 

CD CD 

CD CD 

CD U 

CD CD 

<C 1- 

CD CD 

t— 

t- < 

CD CD 

~ CD CD 

»— 

►- 

CD CD 

< 

O CD 

►- 

C H- 

c CD CD 

CD O 

CD CD 

<C »— 

£ CD CD 

H < 

CD CD 

CD CD 

CD CD 

O CD 

<C h- 

«r h- 

CD CD 

O CD 

O CD 

CD CD 

< t— 

< >— 

)— 

CD CD 

CD CD 

CD CD 

«C 1— 


CD CD 





*— 

CD CD 

lO ~ CD CD 

*— < 

O CD 

h~ C 

Os <L) CD CD 

C CD CD 

K < 

i- 

3 ID CD CD 

E CD CD 

CD CD 

CD CD 

• XT CD CD 

< H 

< »— 

1— C 

W r— CD CD 

»— c 

O CD 

«c ♦— 

C »— 

CD CD 

»— < 

CD CD 

E CD CD 

CD CD 

CD CD 

CD CD 

1— *t 

< f~ 

< h- 

CD CD 

«t H- 

CD CD 

< >— 

CD CD 

CD CD 

< »— 

<C K- 

*-><»— 

«c t— 

CD CD 

»— c 

CJ CD CD 

»— < 

►— < 

«£ ►— 

”OK< 

»— 

*—• CD CD 

CD O 

■D CD CD 

CD CD 

«*- *— ^ 

CD CD 

«c *— 

»— C 

C CD CD 

< J— 

CD CD 

CD CD 


< ►— 

«r »— 

h- <C 

DC CD CD 

U CD 


CD CD 

CD CD 

O CD 

CD CD 

h- C 

C h- 

CD CD 

« h- 

CD CD 

CD CD 


Id to CD O 


c ♦— 

j: jdi- < 


CD CD 

«C »— 


«C K- 

CD CD 


►— «£ 

h- 


CD CD 

CD CD 


C »— 

CD CD 


C h- 

< »— 


CD CD 

CD CD 


<C t— 

CD CD 


O CD 

< ►— 


O CD 

«t h- 


CD O 

CD CD 


CD CD 

CD CD 


< ►— 

CD CD 

•—* 

CD CD 

•— 1— «t 


O CD 

C CD CD 


CD CD 

E CD CD 


> K- 

«£ 1— 

c 

-O CD CD 

CD CD 


DD O CD 

CD CD 


O CD 



CD CD 

I— «t 


*— «c 

<C h- 


< h- 

CD CD 


♦— 

CD CD 


CD CD 

*— 


CD CD 

CD CD 


*— <t 

*— < h~ 


CD CD 

<OK< 


<C |— 


CD 

CD 



»— 

I— 



*— 

«C 

CD 

CD 

VO 

*-» CD 

CD 

•— CD 

CD 

a% 

•—* CD 

CD 

^ < 



<o »— 


CL CD 

CD 

«e 

> CD 

CD 

x: »— 

«E 

i/k 

to CD 

CD 




h- 





CD 

CD 

CD 

CD 


—> t— 





r- 



►— 


C CD 

O 

♦— 



E CD 

CD 

<C 

K— 


CD 

CD 


h- 


CD 

CD 

«c 

t— 


« 

t~ 

CD 

CD 


CD 

CD 

CD 

CD 


CD 

CD 


h- 


t— 


CD 

CD 


h- 

«r 

CD 

CD 


CD 

CD 

►-« 





•— CD 

CD 


CD 

CD 

C ^ 

t— 


y~ 

< 

E CD 

CD 


CD 

CD 

CD 

CD 


t~ 

< 

CD 

CD 


CD 

CD 

t— 

C 



►- 

CD 

CD 


CD 

CD 


h- 


CD 

CD 

«C 

1— 


CD 

CD 

CD 

CD 

ro 


«£ 


u CD CD CD CD CD 

»— <t OU M<h 

C3 O CU •—I CJ 

CD O I— <C 0 < h 

►— CD CD jO t— 

O O <C t— EC3 O 

OU 

o c o o CD cd 


CD CD 

h~ 

CD CD 

1 — «C 

CD CD 

t— 

CD CD 

1— «£ 

CD CD 

CD CD 

CD CD 

» — c 

CD CD 

«c t— 

«C H- 

«« ►- 

t~ c 

<C 1- 

■c »— 

1— «c 

CD CD 

CD CD 

c h- 

• — «X 

CD CD 

CD CD 

3 CD CD 

CD CD 

«C 1— 

r— CD CD 

c t- 

—• CD CD 

«<»- 

CD CD 

< 1— 

cC »— 


O 

c ♦— 

Xi t— 

CD CD 

CD CD 

E CD CD 

CD CD 

CD CD 

1— < 

I— 

CD CD 

CD CD 

CD CD 

»— 

CD CD 

< t— 

C t— 

*—* CD CD 

*— 

C h- 

•— 1— 

O CD CD 

CD CD 

Oh< 

(JH< 

CD CD 

DUO 

ID CD CD 

<C t— 

Eh< 

CD CD 


•—i 


o 

O 

O 

•-H 

CNJ 

CO 

IH 


*—» 


*0 Cl«C h- 
WTJ CD O 

♦— 
♦— 
*- 
I— < 

«ou 

mmOo 

U. OC I— < 

t- o 

U UU CD 
V) Ql U CD 


CD CD 
U CD 


U CD 
O O 
O CD 


•—* CD CD 
>— < 
CO CD 
E U CD 
CD CD 
C ►- 
CD O 
< I— 
C 



^1- 

c* 

u: 


-10 1 
met asn phe gly leu ser leu lie tyr leu val leu val leu lys val val gin cys glu 
GAGUCAGCACUGAACACGGACCCCUCACG AUG AAC UUC GGG CUC AGC UUG AUU UAC CUU GUC CUG GUU UUA AAA GUU GUC CAG UGU GAA 


01 25023 


%\ ft 


31 to 

C. CD 

&- CD 

r- CD 

3 CD 

C CD 

TO CD 

a> cd 


U o 

OI CD 

-C CD 

TO ZD 

CJ ZD 

TO 

r- CD 

3> 


to TO 

to ZD 

4-* TO 

> CD 

■— CD 

31 CD 

TO CD 

CL =9 


O La 

o o c 

O CL CD 

O C O 

O to CD 

O 3 CD 

O O CD 

O »— CD 


n to 

VO L. CD 

a> tn < 

CXJ JZ CD 

VO >-, CD 

CO CJ ZD 

—t U CD 

TO TO Z3 


TO 

CL CD 

TO CD 

—' 4-» <C 

—* u => 

»■—« »— CD 

CNI CL CD 

CM > O 


CJ CD 

3 ZD 

3 CD 

•— CD 

TO 

CD 

to CD 

L ^ 


JC ZD 

a t zd 

•— TO 

TO ZD 

— O 

TO ZD 

TO 

ai cd 


CL3> 

*— CD 

31 CD 

> CD 

31 CD 

> CD 

-C CD 

to 31 


C- ZD 

vo CD 

t~ ZD 

U TO 

3 CD 

TO 3> 

TO CD 

L TO 


JC <_> 

— TO 

0J CD 

OJ CD 

QJ ZD 

•— CD 

r- CD 

CJ CD 


4-> TO 

-c CD 

to z> 

to ZD 

CD 

TO CD 

TO CD 

to ZD 


01 CD 

l- < 

cncD 

V- CD 

1- CD 

O TO 

•— ZD 

*— TO 


JO ZD 

CJ CD 

*- CD 

-C CD 

JC CD 

U CD 

TO ZD 

TO ZD 


CL ZD 

VO ZD 

TO TO 

-»-» TO 

4-» TO 

CL CD 

> CD 

> O 


>lTO 

C ZD 

3 CD 


r— CD 

CJ CD 

C CD 

3 TO 


•— o 

QJ CD 

a> 3> 

CD 

TO ZD 

.C ZD 

to <c 

#— TO 


310 

to < 

f~ CD 

CO CD 

> CD 

CL ZD 

TO TO 

31 CD 


J- 3 

>i=> 

C. ZD 

C To 

4-» CD 

J- CD 

to CD 

O TO 


dl CD 

CD 

QJ CD 

TO 

a > zd 

-C CD 

>1 CD 

L CD 


to ZD 

CD CD 

to 

CD CD 

E TO 

4-» TO 

O ZD 

CL CD 


to u 

5S=> 

l- CD 

>3 ZD 

C- CD 

to CD 

1- CD 

p— CD 


*— CD 

r- CD 

a> cd 

»— CD 

O; cd 

TO 

-C CD 

TO ZD 


to o 

Co CD 

to to 

CD CD 

to zz 

-c: cd 

4-> TO 

> CD 


TO TO 

C ZD 

4-> CD 

CL CD 

c CD 

i— CD 

P— CD 

L TO 


r- CD 

<D CD 

a* zd 

C. CD 

to < 

TO ID 

TO ZD 

JO CD 


TO O 

to < 

E TO 

-*-» ZD 

TO TO 

> CD 

> CD 

4-> <T 


to zd 

1- ID 

c TO 

C. CD 

C. ZD 

>1=D 

C- CD 

to ZJ 


>»o 

CD CD 

<— to 

>i< 

JZ CD 

r— CD 

£ CD 

>»o 


O ID 

to to 

cn cd 

+-» zz> 

4^ TO 

31 CD 

4-* TO 

O ZD 


S- CD 

a> zd 

3 CD 

O.CD 

C TO 

U CD 

3 CD 

<U TO 


QJ CD 

— ZD 

QJ zd 

to TO 

*— TO 

CJ CD 

r- TO 

p— 3) 


tO 3 

*T“ TO 

•— CD 

TO CD 

CD CD 

to TO 

31 CD 

TO 


O 3U 

O L- CD 

O J- CD 

O 4-» CD 

O To CD 

O 1- CD 

O C- CD 

O to CD 


OJ QJ ID 

tnr u 

CO >>TO 

—> <L) IZ> 

^ r — CD 

f"- CJ CD 

O CJ CD 

CO >*0 


*— a 

4-> TO 

*-> ZD 

-h E TO 

—* TO CD 

to ZD 

CNJ to TO 

M 03 


tO < ' 

TO TO 

3 CD 

TO ZD 

TO ZD 

3 CD 

O CD 

O ZD 


>>< 

•— CD 

ai z> 

•— CD 

•— CD 

OJ ID 

l- CD 

L CD 


»— to 

TO CD 

p— CD 

to CD 

TO CD 

<— CD 

OLCD 

CL CD 


= o 

«— CD 

1- CD 

t ID 

1- ZD 

«- CD 

310 

to o 


<D ZD 

TO ZD 

-O CD 

>1< 

CD CD 

CJ CD 

C O 



f— CD 

> CD 

4-* TO 

-*-> ID 

to ZD 

to 3) 

TO CD 

p- TO 


c- cj 

Cl CD 

C CD 

CL CD 

>>TO 

>>TO 

O 3> 

tO ZD 


a> cj 

*- CD 

to 

to TO 

-- CD 

CD 

V- CD 

v>o 

. 

CO ZD 

4-> ZD 

TO TO 

TO CD 

310 

31 CD 

CL CD 

O =D 

«-r 

5stD 

3 CD 

to CD 

TO CD 

O ID 

L- 

L- CD 

>«ZD 


r- O 

r- TO 

>iTO 

*— CD 

CD 

QJ CD 

a> cd 

o 


3io 

CD CD 

r- to: 

TO CD 

CL CD 

to ZD 

to C 

310 

>iTO 

3 CD 

TO CD 

TO 

TO CD 

C CD 

v- CD 

to H> 

Pv 

*— o 

QJ ZD 

*— CD 

TO 13 

r- CD 

to 

QJ CD 

>»o 

W 

CD CD 

*— CD 

TO CD 

> CD 

TO CD 

TO TO 

to Zd 

O 3> 


O 35 

OlO 

C ^ 

3 TO 

3 CD 

CL CD 

O CD 

0.3 

u: 

i- O 

J- CD 

to -c 

<U ZD 

a> zd 

L. CD 

c- CD 

to TO 

CLO 

TO TO 

TO TO 

*— 3) 

r— CD 

4-» ZD 

CL CD 

TO O 


3 CD 

to CD 

CL CD 

J- CD 

O TO 

C CD 

p- CD 

Ol CD 


*“ TO 

>i«t 

co to 

0J CD 

CD 

jcr cd 

TO ZD 

L. O 


CD CD 

<— TO 

TO CD 

tO ZD 

CL CD 

-*-» TO 

> CD 

TO TO 


4-> CD 

3 CD 

3>TO 

QJ ZD 

*- ZD 

»— CD 

S- Z3 

O CD 


Cl ZD 

TO 

l- CD 

»— Z3 

>>< 

TO ZD 

-C CD 

L. CD 


E TO 

3>CD 

TO TO 

*p- TO 

4-» ZD 

> CD 

4-> TO 

CL CD 


3 TO 

O CD 

1- CD 

3 ZD 

*— CD 

L- TO 

*— CD 

f— CD 


<l> ZD 

C CD 

a t cd 

CJ ZD 

TO ZD 

-c o 

TO ZD 

TO 3 


i— ZD 

CL CD 

to ZD 

r— CD 

> CD 

4-* TO 

> O 

>- O 


O »“ CD 

O ^ ZD 

O at cd 

O O ZD 

O l- ZD 

O r— CD 

O s- TO 

CD aj ZD 


•—i « zd 

TO JZ CD 

p— Zd 

O Lu 

ro a* cd 

VO TO ZD 

cri aj cd 

CNJ r— ZD 


> CD 

-*-» TO 

•*- TO 

*-» CL CD 

—' vo ZD 

—• > CD 

*—♦ to ZD 

CM **- TO 


>>TO 

C CD 

i- CD 

O CD 

O TO 

O TO 

C- CD 

to TO 


•— CD 

TO 

^ CD 

V. CD 

1- CD 

C CD 

dj o 

>)< 


CD CD 

31 CD 

•M TO 

CL CD 

CL CD 

CL CD 

to to 

•— TO 


>»CD 

3) CD 

Qj CD 

C7>TO 

O CD 

3 CD 

L- CD 

to O 


*— CD 

L- CD 

-C ZD 

CD 

CJ 

T— TO 

QJ CD 

D^TO 


CDO 

TO CD 

CL =3 

TO TO 

CL CD 

31 CD 

to TO 

•— TO 


U =D 

«— ZD 

CD TO 

TO TO 

J- TO 

O ZD 

3 O 

CL CD 


aj cd 

TO 31 

l- CD 

r— CD 

-C CD 

C- CD 

aj zz> 

tO TO 


V» ZD 

> O 

TO CD 

TO O 

4-* TO 

CL CD 

F— CD 

TO O 


3 CD 

CLO 

>iCD 

tO ZD 

V- CD 

a> cd 

c Z3 

i- o 


p— TO 

c. CD 

r- CD 

>*o 

J= CD 

JC 13 

-c CD 

TO 3) 


CD CD 

+J Z3 

CD CD 

CD ZD 

4-» TO 

CL ZD 

4D TO ’ 

> O 


i — CD 

C- ZD 

3 TO 

C- CD 

to to 

L. ZD 

L- CD 

to O 


TO ZD 

aj cd 

TO 

>!■< 

>>c 

>1< 

>>TO 

>>TO 


> CD 

to ZD 

CD CD 

+D ZD 

•— TO 

4-1 ZD 

4-> ZD 

r- TO 


3 CD 

4J CD 

to 

L. ZD 

TO CD 

>»o 

3 CD 

L CJ 


aj zd 

ai id 

>iCD 


r— O 

i— CD 

<U ZD 

XI CD 


r— CD 

E TO 

O ID 

+J ZD 

TO CD 

31 CD 

p— CD 

-M TO 


4-» CD 

TO CD 

C CD 

4-> CD 

C- TO 

to CD 

CL CD 

L CD 


<D ZD 

i— CD 

p- TO 

<U ZD 

Qi CD 

>>TO 

to TO 

a> o 


E TO 

TO CD 

CD CD 

E TO 

tO ZD 

*— TO 

TO CD 

to to 


•— CD 

fc. ZD 

C»TO 

TO CD 

C- CJ 

p— CD 

t- ZD 

C CD 


TO ZD 


U CD 

i— CD 

a> cj 

TO ZD 

QJ CD 

01 o 


> CD 

«-* ZD 

TO TO 

TO CD 

tO 3> 

> CD 

to ZD 

to TO 




250 260 270 

He phe pro pro lys pro lys asp val leu thr lie thr leu thr pro lys val thr cys val val val asp lie ser lys asp asp pro 
AUC UUC CCC CCA AAG CCC AAG GAU GUG CUC ACC AUU ACU CUG ACU CCU AAG GUC ACG UGU GUU GUG GUA GAC AUC AGC AAG GAU GAU CCC 


0125023 





.* / 

°|l 

3 



o o 

0.3 

k o 

c 3 

o 


w o 

L U 

«A < 

>7«t 

V7 

o 


« u 

0.0 

09 O 

4J 3 


3 







o 


O O/O 

o <e o 

O */7 O 

O C o 

O X 

3 


OC3 

co •— o 

VO >v< 

O) M< 

CM r* O 

O 


CO 0.3 

n 

Or- < 

CO «C 

« 070 

«£ 







O 


k. 3 

O 3 

<D O 

3 O 

0) 

o 


£ O 

k- O 

•— o 

*- < 

•— o 

o 


4-» < 

au 

*e o 

070 

40 o 

3 


1- O 

07 O 

-M O 

«o O 

3 O 

< 


07 O 

X 3 

07 3 

»— o 

•— < 

O. o 


VI C 

0.3 

E «C 

« o 

070 

O 3 


C O 

<0 3 

c o 

o «c 

0.0 

N ia< 



r- O 

i— 

k o 

t- o 

v >< 


*o 

to O 

070 

0.0 

4-4 3 

^ r— «C 


07 U 

« <c 

3 O 

c o 

C O 

>3 


-C 3 

*— o 



t/7 

r- O 


0.3 

« o 

070 

070 

<0 C 

070 


C C9 

v. 3 

4/7 O 

>7« 

k o 

O 3 


e— 

07 O 

>7«t 

r- O 

07 o 

£ O 



t/l < 

*— < 

070 

t/7 < 

0.0 


=3 O 

c o 

o o 

C 3 

IA O 

£ 3 


r- «£ 

M < 

k. o 

40 < 


07 O 


07 CD 

«o < 

0.0 

<o «t 

•— «c 

t/7 3 


3 O 

1 - o 

O 3 

0.0 

c o 

t/7 O 


*— C 

to 3 

k o 

k- o 




07 0 

> O 

0-0 

4-7 3 

070 

X o 

«C 







«c 

070 

070 

o 

c o 

— o 

£ o 

«c 

t o 

u o 

k- o 


40 3 

07 O 


ID O 

fO «c 

0.0 

070 

> o 

t/7 3 

o 







o 

o o 

4/7 o 

07 3 

0.0 

e 3 

3 O 

o 

*- o 

>*o 

*— 3 

k o 

«/7 < 

07 3 

3 

0.0 

O 3 

*•- 

4-» 3 

to 

*— o 

3 







O 

o c < 

O 4/7 «£ 

0^0 

0 3 0 

0 3 0 

o u o 

o 

07 ^ 

>,«C 

m .c o 

CO r- < 

—i 0/ 3 

07C 

o 

CM 070 

COf-< 

n *7< 

CO 070 

^ r— O 

IA< 

3 







o 

v. o 

07 O 

k. o 

*— O 

4/7 O 

4/7 O 


x: o 

.£ 3 

>7«r 

3 

X 

X 

o 

** < 

0.3 

4-> 3 

> O 

*— ^ 

»— < 

o 








c o 

3 O 

O 

£ 3 

k- o 

3 O 

o 

■— <£ 

»— < 

to 3 

£ O 

07 O 

«r 

o 

- cno 

070 

> O 

«t 

4/7 C 

070 

o 








<c 3 

4/7 O 

c o 

07 3 

O 

k 3 

o 

— u 

>»< 

*— c 

t— 3 

X 

X O 

o 

40 O 


070 

*— <C 

4-» 3 

4-> «t 

-t 








». < 

>70 

o < 

0.0 

•— o 

t/7 3 


-c o 

r- O 

k. o 

477 < 

40 3 

*»- C 

3 

«*-» c 

070 

0.0 

«o o 

> O 

JC o 









VO O 

C 3 

*0 3 

3 C 

07 O 

vo o 


c 

4/7 < 

»— o 

»— «r 

x: 3 

*r- C 

3 

x: o 

«o c 

m o 

070 

0.3 

X o 








3 

r- O 

3 O 

4/7 O 

O 3 

k- o 

c o 

O 

*0 3 

07 3 

>7«t 

k- o 

X 

VO «c 

3 

> O 

r- O 

— 

0.0 

4J 3 

40 C 

O 







O 

3 O 

0.0 

o o 

07 O 

k- 3 

IA o 

o 


k. o 

k o 

^ 3 

07 o 


3 

070 

4J 3 

0-0 

0.3 

4A 3 

X o 

o 







o 

r- O 

0.0 

07< 

07 O 

>o 

3 O 

o 

40 3 

477 < 

k o 

^ 3 

r- O 

07 3 

o 

> O 

*o o 

*o «C 

0.3 

070 

*— O 








o 

0.3 

£ O 

>><-> 

0.0 

C 3 

>70 

o 

V) 

*— 

*- o 

4/7 < 

4/7 «£ 

*— o 

3 

<o o 

070 

070 

<T7 O 

40 <£ 

070 

o 







o 

O 0.3 

O 4/1 O 

o «c 

O £ < 

o i- o 

0 3 0 


00 (A< 

—< —* «t 

<0* >»< 

^ o 

o x: o 

CO r~ < 

3 

CSI 

n r o 

CO »— < 

n +7 < 

4^ 

cno 

o 







o 

<■ 

4-> o 

k o 

07 < 

c o 

tn 3 

«c 

40 3 

07 3 

x= o 

*— 3 

t/7 < 

•*- C 

o 

> O 

e 

4-» 

*•“ < 

« «t 

XT O 








o 

07 3 

07 O 

VO < 

4-» O 

4J O 

3 C 

3 

x: 3 

t— 3 

>7<C 

07 3 

07 3 

07 3 

o 

0.3 

-r- ^ 

r— < 

E «t 

E «t 

f— 3 

3 







O 

0.0 

O O 

O 

4/7 O 

07 O 

•— O 


*- o 

k- o 

07 O 

>o 

•— 3 

40 3 

o 

4-> 3 

0.0 

VO 3 

O 3 

**- c 

> O 

o 







«c 

u o 

3 3 

07 O 

k o 

o o 

i- 3 

o 

07 O 

07 3 

*— 3 

x: o 

k- o 

07 O 


IA < 

*— O 

— 

4-* < 

0.0 

4/7 3 

3 







o 

07 U 

3 «t 

k- o 

3 O 

£ O 

VI O 

o 

JC 3 

r— < 

£ O 

07 3 


>»o 

3 

0.3 

070 

4-» 

r— O 

070 

o 3 

o 







o 

c o 

k- 3 

to «£ 

l- 3 

k- 3 

£ O 

3 

— c 

07 O 

>7«t 

07 O 

xr o 

X o 

o 

070 

4/7 < 

•— < 

W7 <C 

*4 «C 

4-» < 

3 







O 

r— O 

r— O 

3 O 

»— O 

c o 

07 O 

o 

*0 3 

40 3 

*— C 

40 3 

t« < 

JZ 3 

o 

> O 

> O 

070 

> O 

40 <C 

0.3 

o 








3 O 

k- < 

07 O 

4/7 < 

t/7 O 

k. 3 

o 

f- < 

07 O 

»— 3 

X 

>7«t 

JC O 

o 

0)0 

V7 3 

*— ■< 

*— < 

f- 

44 «C 

3 


Fig. 5B. 
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